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PRODUCTION OF VERY LONG CHAIN POLYUNSATURATED FATTY ACIDS IN 

OILSEED PLANTS 
This application claims the benefit of U.S. Provisional Application 
5 No. 60/446.941. filed February 12, 2003. the disclosure of which is hereby 
incorporated by reference in its entirety. 

FIELD OF THE INVENTION 
This invention pertains to oilseed plants which have been transformed to 
produce very long chain polyunsaturated fatty acids and to recombinant constructs 
10 and method for producing such fatty acids in a plant. 

BACKGROUND OF THE INVENTION 
Lipids/fatty acids are water-insoluble organic biomolecules that can be 
extracted from cells and tissues by nonpolar solvents such as chloroform, ether or 
benzene. Lipids have several important biological functions, serving (1) as 
15 structural components of membranes, (2) as storage and transport forms of 

metabolic fuel, (3) as a protective coating on the surface of many organisms, and 
(4) as celll-surface components concerned in cell recognition, species specificity 
and tissue immunity. 

The human body is capable of producing most of the fatty acids which' it 
20 requires to function. Two long chain polyunsaturated fatty acids, eicosapentaenoic 
acid (EPA) and docosahexaenoic acid (DHA), however, cannot be synthesized 
efficiently by the human body and, thus, have to be supplied through the diet. Since 
the human body cannot produce adequate quantities of these polyunsaturated fatty 
acids, they are called essential fatty acids. 
25 PUFAs are important components of the plasma membrane of the cell, where 

they may be found in such forms as phospholipids and also can be found in 
triglycerides. PUFAs also serve as precursors to other molecules of importance in 
human beings and animals, including the prostacyclins, leukotrienes and 
prostaglandins. There are two main families of polyunsaturated fatty acids (PUFAs), 
30 specifically, the omega-3 fatty acids and the omega-6 fatty acids. 

DHA is a fatty acid of the omega-3 series according to the location of the last 
double bond in the methyl end. It is synthesized via alternating steps of 
desaturation and elongation. Production of DHA is important because of its 
beneficial effect on human health. Currently the major sources of DHA are oils from 
35 fish and algae. 

EPA and arachidonic acid (AA) are both delta-5 essential fatty acids. EPA 
belongs to the omega-3 series with five double bonds in the acyl chain, is found in 
marine food, and is abundant in oily fish from the North Atlantic. AA belongs to the 
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omega-6 series with four double bonds. The lack of a double bond in the omega-3 
position confers on AA different properties than those found in EPA. The 
eicosanoids produced from AA have strong inflammatory and platelet aggregating 
properties, whereas those derived from EPA have anti-inflammatory and anti- 
platelet aggregating properties, AA can be obtained from some foods such as meat, 
fish, and eggs, but the concentration is low. 

Gamma-linolenic acid (GI_A) is another essential fatty acid found in 
mammals. GLA is the metabolic intermediate for very long chain omega-6 fatty 
acids and for various active molecules. In mammals, formation of long chain PUFAs 
is rate-limited by delta-6 desaturation. Many physiological and pathological 
conditions such as aging, stress, diabetes, eczema, and some infections have been 
shown to depress the delta-6 desaturation step. In addition, GI_A is readily 
catabolized from the oxidation and rapid cell division associated with certain 
disorders, e.g., cancer or inflammation. 

Research has shown that omega-3 fatty acids reduce the risk of heart 
disease as well as having a positive effect on children's development. Results have 
been disclosed indicating the positive effect of these fatty acids on certain mental 
illnesses, autoimmune diseases and joint complaints. Thus, there are many health 
benefits associated with a diet supplemented with these fatty acids. 

Unfortunately, there are several disadvantages associated with commercial 
production of PUFAs from natural sources. Natural sources of PUFAs. such as 
animals and plants, tend to have highly heterogeneous oil compositions. The oils 
obtained from these sources can require extensive purification to separate out one 
or more desired PUFAs or to produce an oil which is enriched in one or more 
PUFAs. Natural sources also are subject to uncontrollable fluctuations in 
availability. Fish stocks may undergo natural variation or may be depleted by 
overfishing. Fish oils have unpleasant tastes and odors which may be difficult, if not 
impossible, to economically separate from the desired product, and can render such 
products unacceptable as food supplements. Animal oils and, in particular, fish oils, 
can accumulate envrionmental pollutants. Weather and disease can cause 
fluctuation in yields from both fish and plant sources. 

An expansive supply of polyunsaturated fatty acids from natural sources and 
from chemical syntheis are not sufficient for commercial needs. Therefore, it is of 
interest to find alternative means to allow production of commercial quantities of 
PUFAs. Biotechnology offers an attractive route for producing LCPUFAs in a safe, 
cost efficient manner. 

WO 02/26946, published April 4, 2002, describes isolated nucleic acid 
molecules encoding FAD4, FADS, FAD5-2 and FAD6 fatty acid desaturase family 

2 



I ■ 



wo 2004/071467 PCTAJS2004/005758 

members which are expressed in LCPUFA-producing organisms, e.g., 
Thraustochytrium, Pythium irregulare, Schizichytrium and Crypthecodinium. It is 
indicated that constructs containing the desaturase genes can be used in any 
expression system including plants, animals, and microorganisms for the production 
5 of cells capable of producing LCPUFAs. 

WO 02/26946, published April 4, 2002. describes FAD4, FADS, FAD5-2. and 
FAD6 fatly acid desaturase members and uses thereof to produce long chain 
polyunsaturated fatty acids. 

WO 98/55625, published December 19, 1998, describes the production of 
10 polyunsaturated fatty acids by expression of polyketide-like synthesis genes in 
plants. 

WO 98/46764, published October 22. 1998, describes compositions and 
methods for preparing long chain fatty acids in plants, plant parts and plant cells 
which utilize nucleic acid sequences and constructs encoding fatty acid 
15 desaturases. including delta-5 desaturases, delta-6 desaturases and delta-12 
desatu rases. 

U.S. Patent No. 6.075.183, issued to Knutzon et al. on June 13. 2000. 
describes methods and compositions for synthesis of long chain polyunsaturated 
fatty acids in plants. 

20 U.S. Patent No. 6,459,018, issued to Knutzon on October 1, 2002. describes 

a method for producing stearidonic acid in plant seed utilizing a construct 

comprising a DNA sequence encoding a delta-six desaturase. 

Spychalla et al., Proc. NatL Acad. ScL USA, Vol.94, 1 142-1 147 (Feb. 1997), 

describes the isolation and characterization of a cDNA from C. elegans that, when 
25 expressed in Arabidopsis, encodes a fatty acid desaturase which can catalyze the 

introduction of an omega-3 double bond into a range of 18- and 20-carbon fatty 

acids. 

SUMMARY OF THE INVENTION 
The invention includes an oilseed plant that produces mature seeds in which 
30 the total seed fatty acid profile comprises at least 1.0| % of at least one 

polyunsaturated fatty acid having at least twenty carbon atoms and five or more 
carbon-carbon double bonds. 

In a second embodiment, this invention includes an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises at least 
35 5.0 % of at least one polyunsaturated fatty acid having at least twenty carbon atoms 
and five or more carbon-carbon double bonds. 

In a third embodiment, this invention includes an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 10.0 % of 
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at least one polyunsaturated fatty acid having at least twenty carbon atoms and five 
or more carbon-carbon double bonds. 

Additional embodiments of this invention Include an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises at least 
5 15.0 %, 20 %, 25 %, 30 %, 40 %, 50 %, or 60 % of at least one polyunsaturated 
fatty acid having at least twenty carbon atoms and five or more carbon-carbon 
double bonds. 

In a fourth embodiment this invention includes an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 10.0 % of 

10 at least one polyunsaturated fatty acid having at least twenty carbon atoms and five 
or more carbon-carbon double bonds and less than 2.0% arachidonic acid. 

Again additional embodiments would include an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 15.0 %, 
20 %, 25 %, 30 %, 40 %, 50 %, or 60 % of at least one polyunsaturated fatty acid 

15 having at least twenty carbon atoms and five or more carbon-carbon double bonds 
and less than 2.0 % arachidonic acid. 

The PUFA can be an omega-3 fatty acid selected from the group consisting 
of EPA, DPA and DHA. 

Also of interest are seeds obtained from such plants and oil obtained from the 

20 seeds of such plants. 

In a fifth embodiment, this invention includes a recombinant construct for 
altering the total fatty acid profile of mature seeds of an oilseed plant, said construct 
comprising at least two promoters wherein each promoter is operably linked to a 
nucleic acid sequence encoding a polypeptide required for making at least 

25 one polyunsaturated fatty acid having at least twenty carbon atoms and four or more 
carbon-carbon double bonds and further wherein the total fatty acid profile 
comprises at least 2 % of at least one polyunsaturated fatty acid having at least 
twenty carbon atoms and four or more carbon-carbon double bonds and further 
wherein said polypeptide is an enzyme selected from the group consisting of a A4 

30 desaturase, a A5 desaturase, a A6 desaturase, a A15 desaturase, a A17 
desaturase, a C18 to C22 elongase and a C20 to C24 elongase. 

In a further aspect, the promoter is selected from the group consisting of the 
alpha prime subunit of beta conglycinin promoter, Kunitz trypsin inhibitor 3 promoter, 
annexin promoter, Glyl promoter, beta subunit of beta conglycinin promoter, 

35 P34/Gly Bd m 30K promoter, albumin promoter. Leg A1 promoter and Leg A2 
promoter. Also of interests are oilseed plants comprising in their genome such 
recombinant constructs, seeds obtained from such plants and oil obtained from the 
seeds of such plants. 

4 
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In a sixth embodiment, this invention includes a method for making an 
oilseed plant having an altered fatty acid profile which comprises: 

a) transforming a plant with the recombinant construct of the fifth 
embodiment; 

5 b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises at 
least 1.0 % of at least one polyunsaturated fatty acid having at least 
twenty carbon atoms and five or more carbon-carbon double bonds. 
In a seventh embodiment, this invention includes a method for making an oilseed 
10 plant having an altered fatty acid profile which comprises: 

a) transforming a plant with the recombinant construct of the fifth 
embodiment including any one of the promoters recited therein, 

b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises at 
15 least 1.0 % of at least one polyunsaturated fatty acid having at least 

twenty carbon atoms and five or more carbon-carbon double bonds. 
Also of interest are oilseed plants made by such methods, seeds obtained 
from such plants and oil obtained from the seeds of such plants. 

In an eighth embodiment, this invention includes a food product, beverage, 
20 infant formula, or nutritional supplement incorporafing any of the oils of the 
invention. 

In a ninth embodiment, this invention includes a food product, pet food or 
animal feed which has incorporated therein any of the seeds of the invention. 

In a tenth embodiment, this invention includes an oilseed plant that produces 

25 mature seeds in which the total seed fatty acid profile comprises polyunsaturated 
fatty acids having at least twenty carbon atoms and five or more carbon-carbon 
double bonds wherein the ratio of EPAiDHA is in the range from 1:100 to 860:100. 
The oilseed plant may further have a total seed fatty acid profile comprising less 
than 2.0% arachidonic acid. Also of interest are seeds obtained from such plants 

30 and oil obtained from the seeds of such plants. 

In an eleventh embodiment, this invention includes an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises 
polyunsaturated fatty acids having at least twenty carbon atoms and five or more 
carbon-carbon double bonds wherein the ratio of DHA:EPA is in the range from 

35 1 :100 to 1 10:100. The oilseed plant may further have a total seed fatty acid profile 
comprising less than 2.0% arachidonic acid. Also of interest are seeds obtained 
from such plants and oil obtained from the seeds of such plants. 
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BIOLOGICAL DEPOSITS 
The following plasmids have been deposited with the American Type Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, VA 20110-2209, and 



bears the following designation, accession number and date of deposit. 


Plasmid 


Accession Number 


Date of Deposit 


pKR274 


ATCC PTA-4988 


Jan. 30. 2003 


pKR275 


ATCC PTA-4989 


Jan. 30, 2003 


pKR357 


ATCC PTA-4990 


Jan. 30, 2003 


pKR365 


ATCC PTA-4991 


Jan. 30, 2003 


pKKE2 


ATCC PTA-4987 


Jan. 30. 2003 



BRIEF DESCRIPTION OF THE 
DRAWINGS AND SEQUENCE LISTINGS 

The invention can be more fully understood from the following detailed 
description and the accompanying drawinjgs and Sequence Listing, which form a 
part of this application. 

The sequence descriptions summarize the Sequences Listing attached 
hereto. The Sequence Listing contains one letter codes for nucleotide sequence 
characters and the single and three letter codes for amino acids as defined in the 
lUPAC-lUB standards described in Nucleic Acids Researvii 73:3021-3030 (1985) 
and in the BiocliemicalJoumal 219 (No. 2;:345-373 (1984). 

Figure 1 shows possible biosynthetic pathways for PUFAs. 

Figure 2 shows possible pathways for production of LC-PUFAs included EPA 
and DHA compiled from a variety of organisms. 

Figure 3 is a schematic depiction of plasmid pKR274. 

Figure 4 is a schematic depiction of plasmid pKKE2. 

Figure 5 is a schematic depiction of plasmid pKR275. 

Figure 6 is a schematic depiction of plasmid pKR365. 

Figure 7 is a schematic depiction of plasmid pl^R364. 

Figure 8 is a schematic depiction of plasmid pKR357. 

SEQ. ID. NO:1 sets forth oligonucleotide primer GSP1 used to amplify the 
soybean annexin promoter. 

SEQ. ID. NO:2 sets forth oligonucleotide primer GSP2 used to amplify the 
soybean annexin promoter. 

SEQ. ID. NO:3 sets forth the sequence of the annexin promoter. 

SEQ. ID. NO:4 sets forth oligonucleotide primer GSP3 used to amplify the 
soybean BD30 promoter. 
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SEQ. ID. NO:5 sets forth oligonucleotide primer GSP4 used to amplify the 
soybean BD30 promoter. 

SEQ. ID. NO:6 sets forth the sequence of the soybean BD30 promoter. 

SEQ. ID. NO:7 sets forth the sequence of the soybean p-coriglycinin P- 
5 subunit promoter. 

SEQ. ID. NO:8 sets forth oligonucleotide primer p-con oligo Bam used to 
amplify the promoter for soybean p-conglycinin p-subunit. V 

SEQ. ID. NO:9 sets forth oligonucleotide primer p-con oligo Not used to 
amplify the promoter for soybean p-conglycinin p-subunit. 
10 SEQ. ID. NO:10 sets forth the sequence of the soybean giycinin Gly-1 

promoter. 

SEQ. ID. NO:1 1 sets forth oligonucleotide primer glyoligo Bam used to 
amplify the Gly-1 promoter. 

SEQ. ID. NO:12 sets forth oligonucleotide primer glyoligo Not used to amplify 
15 the Gly-1 promoter. 

SEQ. ID. NO: 13 sets forth oligonucleotide primer oCGR5-1. 

SEQ. ID. NO:14 sets forth oligonucleotide primer oCGR5-2. 

SEQ. ID. NO: 15 sets forth oligonucleotide primer oSAIb-9. 

SEQ. ID. NO: 16 sets forth oligonucleotide primer oSAib-3. 
20 SEQ. ID. NO: 17 sets forth oligonucleotide primer oSAIb-4. 

SEQ. ID. NO:18 sets forth oligonucleotide primer oSAIb-2. 

SEQ. ID. NO: 19 sets forth oligonucleotide primer LegPro5'. 

SEQ, ID. NO:20 sets forth oligonucleotide primer LegProS'. 

SEQ. ID. NO:21 sets forth oligonucleotide primer LegTerm5'. 
25 SEQ. ID. NO:22 sets forth oligonucleotide primer LegTerm3\ 

SEQ. ID. NO:23 sets forth oligonucleotide primer oKTi5. 

SEQ. ID. NO:24 sets forth oligonucleotide primer oKTiS. 

SEQ. ID. NO:25 sets forth oligonucleotide primer LegA1Pro5\ 

SEQ. ID. NO:26 sets forth oligonucleotide primer LegA1Pro3'. 
30 SEQ. ID. NO:27 sets forth oligonucleotide primer LegAITermS'. 

SEQ. ID. NO:28 sets forth oligonucleotide prirher LegAITermS'. 

SEQ. ID. NO:29 sets forth oligonucleotide primer annreampS'. 

SEQ. ID. NO:30 sets forth oligonucleotide primer annreamp3'. 

SEQ. ID. NO:31 sets forth oligonucleotide primer BD30 reamp5\ 
35 SEQ. ID. NO:32 sets forth oligonucleotide primer BD30 reamp3'. 

SEQ. ID. NO:33 sets forth the sequence of the gene for Mortierella alpina 
delta-6 desaturase. 
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SEQ. ID. NO:34 sets forth the protein sequence of the Mortierella alpina 
delta-6 desaturase. 

SEQ. ID. NO:35 sets forth the sequence of the gene for Saprolegnia diclina 
delta-6 desaturase. 

SEQ. ID. NO:36 sets forth the protein sequence of the Saprolegnia diclina 
delta-6 desaturase. 

SEQ. ID. NO:37 sets forth the sequence of the gene for Saprolegnia diclina 
delta-5 desaturase. 

SEQ. ID. NO:38 sets forth the protein sequence of the Saprolegnia diclina 
delta-5 desaturase. 

SEQ. ID. NO:39 sets forth the sequence of the gene for Thraustochytrium 
aureum elongase. 

SEQ. ID. NO:40 sets forth the protein sequence of the Thraustochytrium 
aureum elongase. 

SEQ. ID. NO:41 sets forth the sequence of the gene for Saprolegnia diclina 
delta-17 desaturase. 

SEQ. ID. NO:42 sets forth the protein sequence of the Saprolegnia diclina 
delta-17 desaturase. 

SEQ. ID. NO:43 sets forth the sequence of the gene for Mortierella alpina 
elongase. 

SEQ. ID. NO:44 sets forth the protein sequence of the Mortierella alpina 
elongase. 

SEQ. ID. NO:45 sets forth the sequence of the gene for Mortierella alpina 
delta-5 desaturase. 

SEQ. ID. NO:46 sets forth the protein sequence of the Mortierella alpina 
delta-5 desaturase. 

SEQ. ID. NO:47 sets forth the sequence of At FAD3, the gene for Arabidopsis 
thaliana delta-15 desaturase. 

SEQ. ID. NO:48 sets forth the protein sequence of the Arabidopsis thaliana 
delta-15 desaturase. 

SEQ. ID. NO:49 sets forth the sequence of the gene for Pavlova sp. 
elongase. 

SEQ. ID. NO:50 sets forth the protein sequence of the Pavlova sp. elongase. 

SEQ. ID. NO:51 sets forth the sequence of the gene for Schizochytrium 
aggregatum delta-4 desaturase. 

SEQ. ID. NO:52 sets forth the protein sequence of the Schizochytrium 
aggregatum delta-4 desaturase. 

SEQ. ID. NO:53 sets forth oligonucleotide primer RSP19F, 

8 
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SEQ. 


ID. 


NO:54 sets forth oligonucleotide primer RSP19R. 




SEQ. 


ID. 


NO:55 sets forth oligonucleotide primer RBP2F. 




SEQ. 


ID. 


NO:56 sets forth oligonucleotide primer RBP2R. 




SEQ. 


ID. 


NO:57 sets forth oligonucleotide primer CGR4F. 


5 


SEQ. 


ID. 


NO:58 sets forth oligonucleotide primer CGR4R. 




SEQ. 


ID. 


NO:59 sets forth oligonucleotide primer oSGIy-1 . 




SEQ. 


ID. 


NO:60 sets forth oligonucleotide primer oSGIy-2. 




SEQ. 


ID. 


NO:61 sets forth consensus desaturase Protein Motif 1 . 




SEQ. 


ID. 


NO:62 sets forth oligonucleotide primer R01 144. 


10 


SEQ. 


ID. 


NO:63 sets forth consensus desaturase Protein Motif 2. 




SEQ. 


ID. 


NO:64 sets forth oligonucleotide primer R01119. 




SEQ. 


ID. 


NO:65 sets forth oligonucleotide primer R01 118. 




SEQ. 


ID. 


NO:66 sets forth consensus desaturase Protein Motif 3. 




SEQ. 


ID. 


NO:67 sets forth oligonucleotide primer R01121. 


15 


SEQ. 


ID. 


NO:68 sets forth oligonucleotide primer R01122. 




SEQ. 


ID. 


NO:69 sets forth consensus desaturase Protein Motif 4. 


■ 


SEQ. 


ID. 


NO:70 sets forth oligonucleotide primer R01146. 




SEQ. 


ID. 


NO:71 sets forth oligonucleotide primer R01 147. 




SEQ. 


ID. 


NO:72 sets forth consensus desaturase Protein Motif 5. 


20 


SEQ. 


ID. 


NO:73 sets forth oligonucleotide primer R01 148. 




SEQ. 


ID. 


NO:74 sets forth consensus desaturase Protein Motif 6. 




SEQ. 


ID. 


NO:75 sets forth oligonucleotide primer R01 114. 




SEQ. 


ID. 


NO:76 sets forth consensus desaturase Protein Motif 7, 




SEQ. 


ID. 


NO:77 sets forth oligonucleotide primer R01 116. 


25 


SEQ. 


ID. 


NO:78 sets forth consensus desaturase Protein Motif 8. 




SEQ. 


ID. 


NO:80 sets forth oligonucleotide primer R01189. 




SEQ. 


ID. 


NO:81 sets forth oligonucleotide primer R01 190. 




SEQ. 


ID. 


NO:82 sets forth oligonucleotide primer R01191. 




SEQ. 


ID. 


NO:83 sets forth oligonucleotide primer R0898. 


30 


SEQ. 


ID. 


NO:84 sets forth oligonucleotide prirrier R0899. 




SEQ. 


ID. 


NO:85 sets forth oligonucleotide prirner R01 1 85. 




SEQ. 


ID. 


NO:86 sets forth oligonucleotide primer R01 186. 




SEQ. 


ID. 


NO:87 sets forth oligonucleotide primer R01187. 




SEQ. 


ID. 


NO:88 sets forth oligonucleotide primer R01212. 


35 


SEQ. 


ID. 


NO:89 sets forth oligonucleotide primer R01213. 




SEQ. 


ID. 


NO:90 sets forth the sequence of the expression cassette that 



comprises the constitutive soybean S-adenosylmethionine synthetase (SAMS) 
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promoter operably linked to a gene for a form of soybean acetolactate synthase 
(ALS) that is capable of conferring resistance to sulfonylurea herbicides. 
SEQ. ID. NO:91 sets forth oligonucleotide primer oSBD30-1. 
SEQ. ID. NO:92 sets forth oligonucleotide primer oSBD30-2, 
5 SEQ. ID. NO:93 sets forth oligonucleotide primer T7pro. 

SEQ. ID. NO:94 sets forth oligonucleotide primer R01327. 
SEQ. ID. NO:95 sets forth oligonucleotide primer GenRacer3'. 
SEQ. ID. NO:96 sets forth oligonucleotide primer oCal-26. 
SEQ. ID. NO:97 sets forth oligonucleotide primer oCal-27. 
10 SEQ. ID. NO:98 sets forth oligonucleotide primer oKTi7. 

DETAILED DESCRIPTION OF THE INVENTION 
All patents, patent applications, and publications cited are incorporated herein 
by reference in their entirety. 

In the context of this disclosure, a number of terms shall be utilized. 
15 Fatty acids are described herein by a numbering system in which the number 

before the colon indicates the number of carbon atoms in the fatty acid, whereas the 
number after the colon is the number of double bonds that are present. The number 
following the fatty acid designation indicates the position of the double bond,from 
the carbocyl end of the fatty acid with the "c" affix for the cis-configuration of the 
20 double bond, e.g., palmitic acid (16:0), stearic acid (18:0), oleic acid (18:1,9c), 
petroselinic acid (18:1. 6c), linoleic acid (18:2,9c,12c), y-linolenic acid (18:3, 
6c,9c,12c) and a-linolenic acid (18:3, 9c,12c,15c). Unless othenwise specified 18:1, 
18:2 and 18:3 refer to oleic, linoleic and linolenic fatty acids. 

"Omega-3 fatty acid" (also referred to as an n-3 fatty acid) includes the 
25 essential fatty acid a-linolenic acid (18:3n-3) (ALA) and its long-chain metabolites. 
In n-3 fatty acids, the first double bond is located at the third carbon from the methyl 
end of the hydrocarbon chain. For n-6 fatty acids, it is located at the sixth carbon. 
Eicosapentaneoic acid (EPA), docosapentaenoic acid (DPA), and 
docosahexanenoic acid (DHA) are examples of omega-3 fatty acids. 
30 "Desaturase" is a polypeptide which can desaturate one or more fatty acids to 

produce a mono- or poly-unsaturated fatty acid or precursor which is of interest. 

A "food analog" is a food-like product manufactured to resemble its food 
counterpart, whether meat, cheese, milk or the like, and is intended to have the 
appearance, taste, and texture of its counterpart. 
35 "Aquaculture feed" refers to feed used in aquafarming which concerns the 

propagation, cultivation or farming of aquatic organisms, animals and/or plants in 
fresh or marine waters, 

10 
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The terms "polynucleotide", "polynucleotide sequence", "nucleic acid 
sequence", and "nucleic acid fragment7"isolated nucleic acid fragment" are used 
interchangeably herein. These terms encompass nucleotide sequences and the 
like. A polynucleotide may be a polymer of RNA or DNA that Is single- or double- 
5 stranded, that optionally contains synthetic, non-natural or altered nucleotide bases. 
A polynucleotide in the form of a polymer of DNA may be comprised of one or more 
segments of cDNA, genomic DNA, synthetic DNA, or mixtures thereof. Nucleotides 
(usually found in their 5 -monophosphate form) are referred to by a single letter 
designation as follows: "A" for adenylate or deoxyadenylate (for RNA or DNA, 

10 respectively), "C" for cytidylate or deoxycytidylate, "G" for guanylate or 

deoxyguanylate, "U" for uridylate, "T" for deoxythymidylate, "R" for purines (A or G), 
"Y" for pyrlmidines (C or T), "K" for G or T, "H" for A or C or T, "I" for inosine, and "N" 
for any nucleotide. 

The terms "subfragment that is functionally equivalent" and "functionally 

15 equivalent subfragment" are used interchangeably herein. These terms refer to a 
portion or subsequence of an isolated nucleic acid fragment in which the ability to 
alter gene expression or produce a certain phenotype is retained whether or not the 
fragment or subfragment encodes an active enzyme. For example, the fragment or 
subfragment can be used In the design of chimeric genes to produce the desired 

20 phenotype in a transformed plant. Chimeric genes can be designed for use in 

suppression by linking a nucleic acid fragment or subfragment thereof, whether or 
not it encodes an active enzyme, in the sense or antisense orientation relative to a 
plant promoter sequence. 

The terms "homology", "homologous", "substantially similar" and 

25 "corresponding substantially" are used interchangeably herein. They refer to nucleic 
acid fragments wherein changes in one or more nucleotide bases do not affect the 
"ability of theTTucleic acid fragment to mediate^efre~expres'sion or produce a certain 
phenotype. These terms also refer to modifications of the nucleic acid fragments of 
the instant invention such as deletion or insertion of one or more nucleotides that do 

30 not substantially alter the functional properties of the resulting nucleic acid fragment 
relative to the initial, unmodified fragment. It is therefore understood, as those 
skilled in the art will appreciate, that the invention encompasses more than the 
specific exemplary sequences. 

Moreover, the skilled artisan recognizes that substantiially similar nucleic acid 

35 sequences encompassed by this invention are also defined by their ability to 

hybridize, under moderately stringent conditions (for example, 0.5 X SSC, 0.1% 
SDS, 60**C) with the sequences exemplified herein, or to any portion of the 
nucleotide sequences disclosed herein and which are functionally equivalent to any 

11 
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of the nucleic acid sequences disclosed herein. Stringency conditions can be 
adjusted to screen for moderately similar fragments, such as homologous 
sequences from distantly related organisms, to highly similar fragments, such as 
genes that duplicate functional enzymes from closely related organisms. Post- 
5 hybridization washes determine stringency conditions. One set of preferred 

conditions involves a series of washes starting with 6X SSC, 0.5% SDS at room 
temperature for 15 min, then repeated with 2X SSC, 0.5% SDS at 45X for 30 min, 
and then repeated twice with 0.2X SSC, 0.5% SDS at 50°C for 30 min. A more 
preferred set of stringent conditions involves the use of higher temperatures in 

1 0 which the washes are identical to those above except for the temperature of the final 
two 30 min washes in 0.2X SSC, 0.5% SDS was increased to SO^C. Another 
preferred set of highly stringent conditions involves the use of two final washes in 
0.1X SSC, 0.1% SDS at 65°C. 

"Gene" refers to a nucleic acid fragment that expresses a specific protein, 

15 including regulatory sequences preceding (5' non-coding sequences) and following 
(3' non-coding sequences) the coding sequence. "Native gene" refers to a gene as 
found in nature with itis own regulatory sequences. "Chimeric gene" refers any gene 
that is not a native gene, comprising regulatory and coding sequences that are not 
found together in nature. Accordingly, a chimeric gene may comprise regulatory 

20 sequences and coding sequences that are derived from different sources, or 

regulatory sequences and coding sequences derived from the same source, but 
arranged in a manner different than that found In nature. A "foreign" gene refers to 
a gene not normally found in the host organism, but that is introduced into the host 
organism by gene transfer. Foreign genes can comprise native genes inserted into 

25 a non-native organism, or chimeric genes. A "transgene" is a gene that has been 
introduced into the genome by a transformation procedure. An "allele" is one of 
several alternative forms of a gene occupying a given locus on a chromosome. 
When all the alleles present at a given locus on a chromosome are the same that 
plant is homozygous at that locus. If the alleles present at a given locus on a 

30 chromosome differ that plant is heterozygous at that|locus. 

"Coding sequence" refers to a DNA sequence that codes for a specific amino 
acid sequence. "Regulatory sequences" refer to nucleotide sequences located 
upstream (5' non-coding sequences), within, or downstream (3' non-coding 
sequences) of a coding sequence, and which influence the transcription, RNA 

35 processing or stability, or translation of the associated coding sequence. Regulatory 
sequences may include, but are not limited to, promoters, translation leader 
sequences, introns, and polyadenylation recognition sequences. 

12 
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"Promoter" refers to a DNA sequence capable of controlling the expression of 
a coding sequence or functional RNA. The promoter sequence consists of proximal 
and more distal upstream elements, the latter elements often referred to as 
enhancers. Accordingly, an "enhancer" is a DNA sequence that can stimulate 
5 promoter activity, and may be an innate element of the promoter or a heterologous 
element inserted to enhance the level or tissue-specificity of a promoter. Promoters 
may be derived in their entirety from a native gene, or be composed of different 
elements derived from different promoters found in nature, or even comprise 
synthetic DNA segments. It is understood by those skilled in the art that different 

10 promoters may direct the expression of a gene in different tissues or cell types, or at 
different stages of development, or in response to different environmental 
conditions. It is further recognized that since in most cases the exact boundaries of 
regulatory sequences have not been completely defined, DNA fragments of some 
variation may have identical promoter activity. Promoters that cause a gene to be 

15 expressed in most cell types at most times are commonly referred to as "constitutive 
promoters". New promoters of various types useful in plant cells are constantly 
being discovered; numerous examples may be found in the compilation by 
Okamuro, J. K., and Goldberg. R. B. (1989) Biochemistry of Plants A5'A-B2. 
The "translation leader sequence" refers to a polynucleotide sequence 

2t) located between the promoter sequence of a gene and the coding sequence. The 
translation leader sequence is present in the fully processed mRNA upstream of the 
translation start sequence. The translation leader sequence may affect processing 
of the primary transcript to mRNA, mRNA stability or translation efficiency. 
Examples of translation leader sequences have been described (Turner, R, and 

25 Foster. G. D. (1995) MoL BiotectinoL 3:225-236). 

The "3' non-coding sequences" or "transcription terminator/termination 
sequences" refer to DNA sequences located downstream of a coding sequence and 
include polyadenylation recognition sequences and other sequences encoding 
regulatory signals capable of affecting mRNA processing or gene expression. The 

30 polyadenylation signal is usually characterized by affecting the addition of 

polyadenylic acid tracts to the 3' end of the mRNA precursor. The use of different 
3' non-coding sequences is exemplified by Ingelbrecht, I. L., et al. (1989) Plant Cell 
1:671-680. 

"RNA transcript" refers to the product resulting from RNA polymerase- 
35 catalyzed transcription of a DNA sequence. When the RNA transcript is a perfect 
complementary copy of the DNA sequence, it is referred to as the primary transcript. 
An RNA transcript is referred to as the mature RNA when it is an RNA sequence 
derived from post-transcriptional processing of the primary transcript. "Messenger 
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RNA (mRNA)" refers to the RNA that is without introns and that can be translated 
into protein by the cell. "cDNA" refers to a DNA that is complementary to and 
synthesized from a mRNA template using the enzyme reverse transcriptase. The 
cDNA can be single-stranded or converted into the double-stranded form using the 
5 Klenow fragment of DNA polymerase I. "Sense" RNA refers to RNA transcript that 
includes the mRNA and can be translated into protein within a cell or in vitro. 
"Antisense RNA" refers to an RNA transcript that is complementary to all or part of a 
target primary transcript or mRNA, and that blocks the expression of a target gene 
(U.S. Patent No. 5,107,065). The complementarity of an antisense RNA may be 

10 with any part of the specific gene transcript, i.e., at the 5' non-coding sequence, 
3' non-coding sequence, introns, or the coding sequence. "Functional RNA" refers 
to antisense RNA, ribozyme RNA, or other RNA that may not be translated but yet 
has an effect on cellular processes. The terms "complement" and "reverse 
complement" are used interchangeably herein with respect to mRNA transcripts, 

15 and are meant to define the antisense RNA of the message. 

The term "operably linked" refers to the association of nucleic acid sequences 
on a single nucleic acid fragment so that the function of one is regulated by the 
other. For example, a promoter Is operably linked with a coding sequence when it is 
capable of regulating the expression of that coding sequence (i.e., that the coding 

20 sequence is under the transcriptional control of the promoter). Coding sequences 
can be operably linked to regulatory sequences in a sense or antisense orientation. 
In another example, the complementary RNA regions of the invention can be 
operably linked, either directly or indirectly, 5' to the target mRNA, or 3' to the target 
mRNA, or within the target mRNA, or a first complementary region is 5' and Its 

25 complement is 3' to the target mRNA. 

Standard recombinant DNA and molecular cloning techniques used herein 
are well known in the art and are described more fully in Sambrook, J., Fritsch, E.F. 
and Maniatis, T. Molecular Cloning: A Laboratory Manual; Cold Spring Harbor 
Laboratory Press: Cold Spring Harbor, 1989. Transformation methods are well 

30 known to those skilled in the art and are described below. 

"PCR" or "Polymerase Chain Reaction" is a technique for the synthesis of 
large quantities of specific DNA segments, consists of a series of repetitive cycles 
(Perkin Elmer Cetus Instruments, Nonwalk, CT). Typically, the double stranded 
DNA is heat denatured, the two primers complementary to the 3' boundaries of the 

35 target segment are annealed at low temperature and then extended at an 

intermediate temperature. One set of these three consecutive steps is referred to 
as a cycle. 
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The term "recombinant" refers to an artificial combination of two othenwise 
separated segments of sequence, e.g., by chemical synthesis or by the 
manipulation of isolated segments of nucleic acids by genetic engineering 
techniques. 

The terms "recombinant constnjct", "expression construct", "chimeric 
construct", "construct", and "recombinant DNA construct" are used interchangeably 
herein. A recombinant construct comprises an artificial combination of nucleic acid 
fragments, e.g., regulatory and coding sequences that are not found together in 
nature. For example, a chimeric construct may comprise regulatory sequences and 
coding sequences that are derived from different sources, or regulatory sequences 
and coding sequences derived from the same source, but arranged in a manner 
different than that found in nature. Such construct may be used by itself or may be 
used in conjunction with a vector. If a vector is used then the choice of vector is 
dependent upon the method that will be used to transform host cells as is well 
known to those skilled in the art. For example, a plasmid vector can be used. The 
skilled artisan is well aware of the genetic elements that must be present on the 
vector in order to successfully transform, select and propagate host cells comprising 
any of the isolated nucleic acid fragments of the invention. The skilled artisan will 
also recognize that different independent transformation events will result in different 
levels and patterns of expression (Jones et al., (1985) EMBO J, 4:241 1-241 8; 
De Almeida et al., (1989) Mol Gen. Genetics 218:78-86), and thus that multiple 
events must be screened in order to obtain lines displaying the desired expression 
level and pattern. Such screening may be accomplished by Southern analysis of 
DNA. Northern analysis of mRNA expression, immunoblotting analysis of protein 
expression, or phenotypic analysis, among others. 

The term "expression", as used herein, refers to the production of a functional 
end-product e.g., a mRNA or a protein (precursor or mature). 

The term "expression cassette" as used herein, refers to a discrete nucleic 
acid fragment into which a nucleic acid sequence or fragment can be moved. 

"Mature" protein refers to a post-translationally processed polypeptide; i.e., 
one from which any pre- or propeptides present in the primary translation product 
have been removed. "Precursor" protein refers to the primary product of translation 
of mRNA; i.e., with pre- and propeptides still present. Pre- and propeptides may be 
but are not limited to intracellular localization signals. 

"Stable transformation" refers to the transfer of a nucleic acid fragment into a 
genome of a host organism, including both nuclear and organellar genomes, 
resulting in genetically stable inheritance. In contrast, "transient transformation" 
refers to the transfer of a nucleic acid fragment Into the nucleus, or DNA-containing 

15 
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organelle, of a host organism resulting in gene expression without integration or 
stable inheritance. Host organisms containing the transformed nucleic acid 
fragments are referred to as "transgenic" organisms. 

"Antisense inhibition" refers to the production of antisense RNA transcripts 
capable of suppressing the expression of the target protein. "Co-suppression" 
refers to the production of sense RNA transcripts capable of suppressing the 
expression of identical or substantially similar foreign or endogenous genes 
(U.S. Patent No. 5,231,020). Co-suppression constructs in plants previously have 
been designed by focusing on overexpression of a nucleic acid sequence having 
homology to an endogenous mRNA, in the sense orientation, which results In the 
reduction of all RNA having homology to the overexpressed sequence (see 
Vaucheret et al. (1998) Plant J. 16:651-659; and Gura (2000) Nature 404:804-808). 
The overall efficiency of this phenomenon is low, and the extent of the RNA 
reduction is widely variable. Recent work has described the use of "hairpin" 
structures that incorporate all, or part, of an mRNA encoding sequence in a 
complementary orientation that results in a potential "stem-loop" structure for the 
expressed RNA (PCT Publication WO 99/53050 published on October 21, 1999 and 
more recently. Applicants' assignee's PCT Application having international 
publication number WO 02/00904 published on January 3, 2002). This increases 
the frequency of co-suppression in the recovered transgenic plants. Another 
variation describes the use of plant viral sequences to direct the suppression, or 
"silencing", of proximal mRNA encoding sequences (PCT Publication WO 98/36083 
published on August 20. 1998). Both of these co-suppressing phenomena have not 
been elucidated mechanistically, although genetic evidence has begun to unravel 
this complex situation (Elmayan et al. (1998) Plant Ce// 10:174^-1757). 

The polynucleotide sequences used for suppression do not necessarily have 
to be 100% complementary to the polynucleotide sequences found in the gene to be 
suppressed. For example, suppression of all the subunits of the soybean seed 
storage protein R-conglycinin has been accomplished using a polynucleotide 
derived from a portion of the gene encoding the a subunit (U.S. Patent 
No. 6,362,399). B-conglyclnIn Is a heterogeneous glycoprotein composed of varying 
combinations of three highly negatively charged subunits identified as a. a" and p. 
The polynucleotide sequences encoding the a and a' subunits are 85% identical to 
each other while the polynucleotide sequences encoding the p subunit are 75 to 
80% identical to the a and a' subunits. Thus, polynucleotides that are at least 75% 
identical to a region of the polynucleotide that is target for suppression have been 
shown to be effective in suppressing the desired target. The polynucleotide should 
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be at least 80% identical, preferably at least 90% identical, nnost preferably at least 
95% identical, or the polynucleotide may be 100% identical to the desired target. 

The present invention concerns an oilseed plant that produces mature seeds 
in which the total seed fatty acid profile comprises at least 1 .0 % of at least one 
5 polyunsaturated fatty acid having at least twenty carbon atoms and five or more • 
carbon-carbon double bonds. 

In a second embodiment, this invention concerns an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises at least 
5.0 % of at least one polyunsaturated fatty acid having at least twenty carbon atoms 

10 and five or more carbon-carbon doubly bonds. 

In a third embodiment, this invention concerns an oilseed plant that produces 
mature seeds In which the total seed fatty acid profile comprises at least 10.0 % of 
at least one polyunsaturated fatty acid having at least twenty carbon atoms and five 
or more carbon-carbon double bonds. 

15 Additional embodiments of this invention include an oilseed plant that 

produces mature seeds in which the total seed fatty acid profile comprises at least 
15.0 %, 20 %, 25 %. 30 %, 40 %. 50 %, or 60 % of at least one polyunsaturated 
fatty acid having at least twenty carbon atoms and five or more carbon-carbon 
double bonds. Indeed, one might expect that any integer level of accumulation of at 

2X3 least one polyunsaturated fatty acid from about 1 % to about 60 % of the total seed 
fatty acid profile could be obtained. 

In a fourth embodiment, this invention concerns an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises at least 
10.0 % of at least one polyunsaturated fatty acid having at least twenty carbon 

25 atoms and five or more carbon-carbon double bonds and less than 2.0% 
arachidonic acid. 

Again additional embodiments would include an oilseed plant that produces 
mature seeds in which the total seed fatty acid profile comprises at least 15.0 %, 
20 %, 25 %, 30 %, 40 %, 50 %, or 60 % of at least one polyunsaturated fatty acid 
30 having at least twenty carbon atoms and five or more^ carbon-carbon double bonds 
and less than 2.0% arachidonic acid. Indeed, one might expect that any integer level 
of accumulation of at least one polyunsaturated fatty acid from about 1 % to about 
60 % of the total seed fatty acid profile could be obtained while accumulating less 
than 2 % arachidonic acid. 
35 Examples of oilseed plants include, but are not limited to, soybean, Brassica 

species, sunflower, maize, cotton, flax, and safflower. 

Examples of polyunsaturated fatty acids having at least twenty carbon atoms 
and five or more carbon-carbon double bonds include, but are not limited to, omega- 
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3 fatty acids such as EPA, DPA and DHA. Seeds obtained from such plants are 
also within the scope of this invention as well as oil obtained from such seeds. 

In a fifth embodiment this invention concerns a recombinant construct for 
altering the total fatty acid profile of mature seeds of an oilseed plant, said construct 
5 comprising at least two promoters wherein each promoter is operably linked to a 
nucleic acid sequence encoding a polypeptide required for making at least one 
polyunsaturated fatty acid having at least twenty carbon atoms and four or more 
carbon-carbon double bonds and further wherein the total fatty acid profile 
comprises at least 2 % of at least one polyunsaturated fatty acid having at least 

10 twenty carbon atoms and four or more carbon-carbon double bonds and further 
wherein said polypeptide is an enzyme selected from the group consisting of a 
A 4 desaturase, a A 5 desaturase, A 6 desaturase, a A 15 desaturase, a 
A 17 desaturase, a CI 8 to C22 elongase and a C20 to C24 elongase. 

Such desaturases are discussed in U.S. Patent Nos. 6,075,183, 5,968,809, 

15 6,136.574, 5,972,664. 6,051,754. 6,410,288 and WO 98/46763, WO 98/46764. 
WO 00/12720. WO 00/40705 

The choice of combination of cassettes used depends in part on the PUFA 
profile and/or desaturase profile of the oilseed plant cells to be transformed and the 
LC-PUFA which is to be expressed. 

20 A number of enzymes are involved in PUFA biosynthesis. Linoleic acid (LA, 

18:2 A9,12) is produced from oleic acid (18:1 A9) by a delta-12 desaturase. GLA 
(18:3 A 6, 9,12) is produced from linoleic acid (18:2 A9.12 ) by a delta-6 desaturase. 
ARA(20:4 A 5, 8, 1 1, 14 ) production from dihomo-gamma-linolenic acid (DGLA 20:3 
A 8, 1 1, 14 ) is catalyzed by a delta-5 desaturase. However, animals cannot 

26 desaturate beyond the delta-9 position and therefore cannot convert oleic acid (18:1 
A9) into-linoleic acid (LA, 18:2 A9,12). Likewise, alpha-linolenic acid (ALA 18:3 A 9, 
12, 15) cannot be synthesized by mammals. Other eukaryotes, including fungi and 
plants, have enzymes which desaturate at positions delta-12 and delta-5. The major 
poly-unsaturated fatty acids of animals therefore are either derived from diet and/or 

30 from desaturation and elongation of linoleic acid (LA, 18:2 A9,12) or alpha-linolenic 
acid (ALA 18:3 A 9, 12, 15). 

The elongation process in plants involves a four-step process initiated by the 
crucial step of condensation of malonate and a fatty acid with release of a carbon 
dioxide molecule. The substrates in fatty acid elongation are CoA thioesters. The 

35 condensation step is mediated by a 3-ketoacyl synthase, which is generally rate 

limiting to the overall cycle of four reactions and provides some substrate specificity. 
The product of one elongation cycle regenerates a fatty acid that has been extended 
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by two carbon atoms (Browse et a!., Trends in Biochemical Sciences 27(9): 467-473 
(September 2002); Napier, Trends in Plant Sciences 7(2): 51-54 (February 2002)). 

As was noted above, a promoter is a DNA sequence that directs cellular 
machinery of a plant to produce RNA from the contiguous coding sequence 
5 downstream (3') of the promoter. The promoter region influences the rate, 

developmental stage, and cell type in which the RNA transcript of the gene is made. 
The RNA transcript is processed to produce messenger RNA (mRNA) which serves 
as a template for translation of the RNA sequence into the amino acid sequence of 
the encoded polypeptide. The 5' non-translated leader sequence is a region of the 

10 mRNA upstream of the protein coding region that may play a role in initiation and 
translation of the mRNA. The 3' transcription termination/polyadenylation signal is a 
non-translated region downstream of the protein coding region that functions in the 
plant cells to cause termination of the RNA transcript and the addition of 
polyadenylate nucleotides to the 3' end of the RNA. 

1 5 The origin of the promoter chosen to drive expression of the coding sequence 

is not important as long as it has sufficient transcriptional activity to accomplish the 
invention by expressing translatable mRNA for the desired nucleic acid fragments in 
the desired host tissue at the right time. Either heterologous or non-heterologous 
(i.e., endogenous) promoters can be used to practice the invention. 

20 Suitable promoters which can be used to practice the invention include, but 

are hot limited to, the alpha prime subunit of beta conglycinin promoter, Kunitz 
trypsin inhibitor 3 promoter, annexin promoter, Glyl promoter, beta subunit of beta 
conglycinin promoter, P34/Gly Bd m 30K promoter, albumin promoter, Leg A1 
promoter and Leg A2 promoter. The level of activity of the annexin, or P34, 

25 promoter is comparable to that of many known strong promoters, such as the CaMV 
35S promoter (Atanassova et al., (1998) Plant MoL BioL 37:275-285; Battraw and 
Hall, (1990) Plant Mol. Biol. 15:527-538; Holtorf et al., (1996) Plant MoL Biol. 
29:637-646; Jefferson et al., (1987) EMBO J. 6:3901-3907; Wilmink et al., (1995) 
Plant MoL BioL 28:949-955), the Arabidopsis oleosin promoters (Plant et al., (1994) 

30 Plant MoL BioL 25:193-205; Li, (1997) Texas A&M University Ph.D. dissertation, 
pp. 107-128), the Arabidopsis ubiquitin extension protein promoters (Callis et al.. 
1990). a tomato ubiquitin gene promoter (Rollfinke et al.. 1998). a soybean heat 
shock protein promoter (SchoffI et al., 1989). and a maize H3 histone gene promoter 
(Atanassova et al., 1998). 

35 Expression of chimeric genes in most plant cell makes the annexin, or P34, 

promoter, which constitutes the subject matter of Applicants' Assignee's copending 
application having Application No. 60/446,833 and Attorney Docket No. BB-1531 
which is filed concurrently herev^th, especially useful when seed specific expression 
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of a target heterologous nucleic acid fragment is required. Anotlier useful feature of 
tlie annexin promoter is its expression profile in developing seeds. The annexin 
promoter of the invention is most active in developing seeds at early stages (before 
10 days after pollination) and is largely quiescent in later stages. The expression 
5 profile of the annexin promoter is different from that of many seed-specific 

promoters, e.g., seed storage protein promoters, which often provide highest activity 
in later stages of development (Chen et al., (1989) Dev. Genet 10:1 12-122; 
Ellerstrom et al.. (1996) Plant MoL BioL 32:1019-1027; Keddie et al., (1994) Plant 
MoL BioL 24:327-340; Plant et al., (1994) Plant MoL BioL 25:193-205; LI, (1997) 

10 Texas A&M University Ph.D. dissertation, pp. 107-128). The P34 promoter has a 
more conventional expression profile but remains distinct from other known seed 
specific promoters. Thus, the annexin, or P34, promoter will be a very attractive 
candidate when overexpression, or suppression, of a gene in embryos is desired at 
an early developing stage. For example, it may be desirable to overexpress a gene 

15 regulating early embryo development or a gene involved in the metabolism prior to 
seed maturation. 

The promoter is then operably linked in a sense orientation using 
conventional means well known to those skilled in the art. 

Once the recombinant construct has been made, it may then be introduced 

20 into the oilseed plant cell of choice by methods well known to those of ordinary skill 
in the art including, for example, transfection, transformation and electroporation as 
described above. The transformed plant cell is then cultured and regenerated under 
suitable conditions permitting expression of the LC-PUFA which is then recovered 
and purified. 

25 The recombinant constructs of the invention may be introduced into one plant 

cell or, alternatively, each construct may be introduced into separate plant cells. 

Expression in a plant cell may be accomplished in a transient or stable 
fashion as is described above. 

The desired LC-PUFAs can be expressed in seed. Also within the scope of 
30 this invention are seeds or plant parts obtained from such transformed plants. 

Plant parts include differentiated and undifferentiated tissues, including but 
not limited to, roots, stems, shoots, leaves, pollen, seeds, tumor tissue, and various 
forms of cells and culture such as single cells, protoplasts, embryos, and callus 
tissue. The plant tissue may be in plant or in organ, tissue or cell culture. 
35 Methods for transforming dicots, primarily by use of Agrobacterium 

tumefaciens, and obtaining transgenic plants have been published, among others, 
for cotton (U.S. Patent No. 5,004,863, U.S. Patent No. 5,159,135); soybean 
(U.S. Patent No. 5,569,834, U.S. Patent No. 5,416,011); Brassica (U.S. Patent 
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No. 5,463,174); peanut (Cheng et al. (1996) Plant Cell Rep. 15:653-657, McKently 
et al. (1995) Plant Cell Rep. 14:699-703);. papaya (Ling, K. et al. (1991) 
Bio/technology 9:752-758); and pea (Grant et al. (1995) Plant Cell Rep. 
15:254-258). For a review of other commonly used methods of plant transformation 
see Newell. CJK. (2000) MoL BiotechnoL 16:53-65. One of these methods of 
transformation uses Agrobacterium rhizogenes (Tepfler, M. and Casse-Delbart, F. 

(1987) Microbiol. Sci. 4:24-28). Transformation of soybeans using direct delivery of 
DNA has been published using PEG fusion (PCT publication WO 92/17598), 
electroporation (Chowrira, G.M. et al. (1995) MoL BiotechnoL 3:17-23; Christou, P. 
etal. (1987) Proc. NatL Acad. ScL (7. A 84:3962-3966), microinjection, or particle 
bombardment (McCabe, D.E. et. al. (1988) Bio/Technology 6:923\ Christou et al. 

(1 988) Plant PhysioL 87:671 -674). 

There are a variety of methods for the regeneration of plants from plant 
tissue. The particular method of regeneration will depend on the starting plant tissue 
and the particular plant species to be regenerated. The regeneration, development 
and cultivation of plants from single plant protoplast transformants or from various 
transformed explants is well known in the art (Weissbach and Weissbach, (1988) 
In.: Methods for Plant Molecular Biology, (Eds.), Academic Press, Inc., San Diego, 
CA). This regeneration and growth process typically includes the steps of selection 
of transformed cells, culturing those individualized cells through the usual stages of 
embryonic development through the rooted plantlet stage. Transgenic embryos and 
seeds are similarly regenerated. The resulting transgenic rooted shoots are 
thereafter planted in an appropriate plant growth medium such as soil. Preferably, 
the regenerated plants are self-pollinated to provide homozygous transgenic plants. 
Othenwise, pollen obtained from the regenerated plants is crossed to seed-grown 
plants of agronomically important lines. Conversely, pollen from plants of these 
-^important-lines is'used to pollinate regenerated-plants. A transgenic plant of the 
present invention containing a desired polypeptide is cultivated using methods well 
known to one skilled in the art. 

In addition to the above discussed procedures, practitioners are familiar with 
the standard resource materials which describe specific conditions and procedures 
for the construction, manipulation and isolation of macromolecules (e.g., DNA 
rnolecules, plasmids, etc.), generation of recombinant DNA fragments and 
recombinant expression constructs and the screening and isolating of clones, (see 
for example, Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, Cold 
Spring Hartjor Press; Maliga et al. (1995) Methods in Plant Molecular Biology. Cold 
Spring Harbor Press; Birren et al. (1998) Genome Analysis: Detecting Genes, 1, 
Cold Spring Harbor, New York; Birren et al. (1998) Genome Analysis: Analyzing 
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DNA. 2. Cold Spring Harbor. New York; Plant Molecular Biology: A Laboratory 
Manual, eds. Clark, Springer, New York (1997)). 

In another aspect, this invention concerns a method for making an oilseed 
plant having an altered fatty acid profile which comprises: 
5 a) transforming a plant with the recombinant construct of the invention; 

b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises at 
least 1.0 % of at least one polyunsaturated fatty acid having at least twenty carbon 
atoms and five or more carbon-<;arbon double bonds. 

10 Methods of isolating seed oils are well known in the art: (Young et al, 

Processing of Fats and Oils, in 'The Lipid Handbook" (Gunstone et al eds.) 
Chapter 5 pp 253-257; London, Chapman & Hall, 1994). 

The altered seed oils can then be added to nutritional compositions such as a 
nutritional supplement, food products, infant formula, animal feed, pet food and the 

15 like. 

Compared to other vegetable oils, the oils of the invention are believed to 
function similarly to other oils in food applications from a physical standpoint. 
Partially hydrogenated oils, such as soybean oil, are widely used as ingredients for 
soft spreads, margarine and shortenings for baking and frying. 

20 Examples of food products or food analogs into which altered seed oils or 

altered seeds of the invention may be incorporated include a meat product such as 
a processed meat product, a cereal food product, a snack food product, a baked 
goods product, a fried food product, a health food product, an infant formula, a 
beverage, a nutritional supplement, a dairy product, a pet food product, animal feed 

25 or an aquaculture food product. Food analogs can be made use processes well 

known to those skilled in the art. U.S. Patent Nos. 6,355,296 B1 and 6,187,367 B1 
describe emulsified meat analogs and emulsified meat extenders. U.S. Patent 
No. 5.206,050 B1 describes soy protein curd useful for cooked food analogs (also 
can be used as a process to form a curd useful to make food analogs). U.S. Patent 

30 No. 4,284,656 to Hwa describes a soy protein curd useful for food analogs. 
U.S. Patent No. 3,988,485 to Hibbert et al, describes a meat-like protein food 
formed from spun vegetable protein fibers. U.S. Patent No. 3,950,564 to Puski et al. 
describes a process of making a soy based meat substitute and U.S. Patent 
No. 3,925,566 to Reinhart et al. describes a simulated meat product. For example, 

35 soy protein that has been processed to impart a structure, chunk or fiber for use as 
a food ingredient is called "textured soy protein" (TSP). TSPs are frequently made 
to resemble meat, seafood, or poultry in structure and appearance when hydrated. 
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There can be mentioned meat analogs, cheese analogs, milk analogs and 
the like. 

Meat analogs made from soybeans contain soy protein or tofu and other 
ingredients mixed together to simulate various kinds of meats. These meat 
5 alternatives are sold as frozen, canned or dried foods. Usually, they can be used 
the same way as the foods they replace. Meat alternatives made from soybeans 
are excellent sources of protein, iron and B vitaniins. Examples of meat analogs 
include, but are not limited to, ham analogs, sausage analogs, bacon analogs, and 
the like. 

10 Food analogs can be classified as imitiation or substitutes depending on their 

functional and compositional characteristics. For example, an imitation cheese 
need only resemble the cheese it is designed to replace. However, a product can 
generally be called a substitute cheese only if it Is nutritionally equivalent to the 
cheese it is replacing and meets the minimum compositional requirements for that 

15 cheese. Thus, substitute cheese will often have higher protein levels than imitation 
cheeses and be fortified with vitamins and minerals. 

Milk analogs or nondairy food products include, but are not limited to. 
imitation milk, nondairy frozen desserts such as those made from soybeans and/or 
soy protein products. 

20 Meat products encompass a broad variety of products. In the United States 

"meat" includes "red meats" produced fronh cattle, hogs and sheep. In addition to 
the red meats there are poultry items which include chickens, turkeys, geese, 
guineas, ducks and the fish and shellfish. There is a wide assortment of seasoned 
and processes meat products: fresh, cured and fried, and cured and cooked. 

25 Sausages and hot dogs are examples of processed meat products. Thus, the term 
"meat products" as used herein includes, but is not limited to, processed meat 
products. 

A cereal food product is a food product derived from the processing of a 
cereal grain. A cereal grain includes any plant from the grass family that yields an 
30 edible grain (seed). The most popular grains are barley, corn, millet, oats, quinoa, 
rice, rye, sorghum, triticale, wheat and wild rice. Examples of a cereal food product 
include, but are not limited to. whole grain, crushed grain, grits, flour, bran, germ, 
breakfast cereals, extruded foods, pastas, and the like. 

A baked goods product comprises any of the cereal food products mentioned 
35 above and has been baked or processed in a manner comparable to baking, i.e., to 
dry or harden by subjecting to heat. Examples of a baked good product include, but 
are not limited to bread, cakes, doughnuts, bread crumbs, baked snacks, mini- 
biscuits, mini-crackers, mini-cookies, and mini-pretzels. As was mentioned above, 
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oils of the invention cap be used as an ingredient. 

In general, soybean oil is produced using a series of steps involving the 
extraction and purification of an edible oil product from the oil bearing seed. 
Soybean oils and soybean byproducts are produced using the generalized steps 
shown in the diagram below. 
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Process Impurities Removed/ 

Byproducts Obtained 

Soybean 

t 

Oil Extraction ► Meal 

Degumming Lecithin 

Alkali or Physical Refining ► Gums, Free Fatty Acids, Pigments 

Water Washing Soap 

t 

Bleaching ► Color, Soap, Metal 

t 

(Hydrogenation) 

t 

(Winterization) ^«*^ Stearine 

t 

Deodorization FFA, Tocopherols, Sterols, Volatiles 

t 

Oil Products 



Soybean seeds are cleaned, tempered, dehulled, and flaked which increases 
the efficiency of oil extraction. Oil extraction is usually accomplished by solvent 
5 (hexane) extraction but can also be achieved by a combination of physical pressure 
and/or solvent extraction. The resulting oil is called crude oil. The crude oil may be 
degummed by hydrating phospholipids and other polar and neutral lipid complexes 
that facilitate their separation from the nonhydrating, triglyceride fraction (soybean 
oil). The resulting lecithin gums may be further processed to make commercially 
10 important lecithin products used in a variety of food and industrial products as 
emulsification and release (antisticking) agents. Degummed oil may be further 
refined for the removal of impurities; primarily free fatty acids, pigments, and 
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residual gums. Refining is accomplished by tlie addition of a caustic agent that 
reacts with free fatty acid to form soap and hydrates phosphatides and proteins in 
the crude oil. Water is used to wash out traces of soap formed during refining. The 
soapstock byproduct may be used directly in animal feeds or acidulated to recover 
5 the free fatty acids. Color is removed through adsorption with a bleaching earth that 
removes most of the chlorophyll and carotenoid compounds. The refined oil can be 
hydrogenated resulting in fats with various melting properties and textures. 
Winterization (fractionation) may be used to remove stearine from the hydrogenated 
oil through crystallization under carefully controlled cooling conditions. 

10 Deodorlzation which is principally steam distillation under vacuum, is the last step 
and is designed to remove compounds which impart odor or flavor to the oil. Other 
valuable byproducts such as tocopherols and sterols may be removed during the 
deodorization process. Deodorized distillate containing these byproducts may be 
sold for production of natural vitamin E and other high-value pharmaceutical 

15 products. Refined, bleached, (hydrogenated, fractionated) and deodorized oils and 
fats may be packaged and sold directly or further processed into more specialized 
products. A more detailed reference to soybean seed processing, soybean oil 
production and byproduct utilization can be found in Erickson, 1995, Practical 
Handbook of Soybean Processing and Utilization, The American Oil Chemists' 

20 Society and United Soybean Board. 

Soybean oil lis liquid at room temperature because it is relatively low in 
saturated fatty acids when compared with oils such as coconut, palm, palm kernel 
and cocoa butter. Many processed fats, Including spreads, confectionary fats, hard 
butters, margarines, baking shortenings, etc., require varying degrees of solidity at 

25 room temperature and can only be produced from soybean oil through alteration of 
its physical properties. This is most commonly achieved through catalytic 
hydrogenation. 

Hydrogenation is a chemical reaction in which hydrogen is added to the 
unsaturated fatty acid double bonds with the aid of a catalyst such as nickel. High 
30 oleic soybean oil contains unsaturated oleic, linoleic, |and linolenic fatty acids and 
each of these can be hydrogenated. Hydrogenation has two primary effects. First, 
the oxidative stability of the oil is increased as a result of the reduction of the 
unsaturated fatty acid content. Second, the physical properties of the oil are 
changed because the fatty acid modifications increase the melting point resulting in 
35 a semi-liquid or solid fat at room temperature. 

There are many variables which affect the hydrogenation reaction which in 
turn alter the composition of the final product. Operating conditions including 
pressure, temperature, catalyst type and concentration, agitation and reactor design 
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are among the more important parameters which can be controlled. Selective 
hydrogenation conditions can be used to hydrogenate the more unsaturated fatty 
acids In preference to the less unsaturated ones. Very light or brush hydrogenation 
is often employed to increase stability of liquid oils. Further hydrogenation converts 
5 a liquid oil to a physically solid fat. The degree of hydrogenation depends on the 
desired performance and melting characteristics designed for the particular end 
product. Liquid shortenings, used in the manufacture of baking products, solid fats 
and shortenings used for commercial frying and roasting operations, and base 
stocks for margarine manufacture are among the myriad of possible oil and fat 
10 products achieved through hydrogenation. A more detailed description of 

hydrogenation and hydrogenated products can be found in Patterson, H. B. W., 
1994. Hydrogenation of Fats and Oils: Theory and Practice. The American Oil 
Chemists* Society. 

Hydrogenated oils have also become controversial due to the presence of 
15 trans fatty acid isomers that result from the hydrogenation process. Ingestion of 
large amounts of trans isomers has been linked with detrimental health effects 
including increased ratios of low density to high density lipoproteins in the blood 
plasma and increased risk of coronary heart disease. 

A snack food product comprises any of the above or below described food 
20 products. 

A fried food product comprises any of the above or below described food 
products that has been fried. 

A health food product is any food product that imparts a health benefit. IVIany 
oilseed-derived food products may be considered as health foods. 
25 The beverage can be in a liquid or in a dry powdered form. 

For example, there can be mentioned non-carbonated drinks; fruit juices, 
fresh, frozen, canned or concentrate; flavored or plain milk drinks, etc. Adult and 
infant nutritional formulas are well known in the art and commercially available (e.g., 
Similac®, Ensure®, Jevity®, and Alimentum® from Ross Products Division, Abbott 
30 Laboratories). 

Infant formulas are liquids or reconstituted povvders fed to infants and young 
children. They serve as substitutes for human milk. Infant formulas have a special 
role to play in the diets of infants because they are often the only source of nutrients 
for infants. Although breast-feeding is still the best nourishment for infants, infant 
35 formula is a close enough second that babies not only survive but thrive. Infant 
formula is becoming more and more increasingly close to breast milk. 

A dairy product is a product derived from milk. A milk analog or nondairy 
product is derived from a source other than milk, for example, soymilk as was 
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discussed above. The.se products include; but are not limited to, whole milk, skim 
milk, fermented milk products such as yogurt or sour milk, cream, butter, condensed 
milk, dehydrated milk, coffee whitener, coffee creamer, ice cream, cheese, etc. 

A pet food product is a product intended to be fed to a pet such as a dog. cat. 
5 bird, reptile, fish, rodent and the like. These products can include the cereal and 
health food products above, as well as meat and meat byproducts, soy protein 
products, grass and hay products, including but not limited to alfalfa, timothy, oat or 
brome grass, vegetables and the like. 

Animal feed is a product intended to be fed to animals such as turkeys, 
10 chickens, cattle and swine and the like. As with the pet foods above, these products 
can include cereal and health food products, soy protein products, meat and meat 
byproducts, and grass and hay products as listed above. 

Aqualculture feed is a product intended to be used in aquafarming which 
concerns the propagation, cultivation or farming of aquatic organisms, animals 
1 5 and/or plants in fresh or marine waters. 

In yet another embodiment, this invention includes an oilseed plant that 
produces mature seeds in which the total seed fatty acid profile comprises 
polyunsaturated fatty acids having at least twenty carbon atoms and five or more 
carbon-carbon double bonds wherein the ratio of EPA:DHA is in the range from 
20 1:100 to 860:100. The oilseed plant may further have a total seed fatty acid profile 
comprising less than 2.0% arachidonic acid. Also of interest are seeds obtained 
from such plants and oil obtained from the seeds of such plants. 

In still yet another embodiment, this invention includes an oilseed plant th^t 
produces mature seeds in which the total seed fatty acid profile comprises 
25 polyunsaturated fatty acids having at least twenty carbon atoms and five or more 
carbon-carbon double bonds wherein the ratio of DHA:EPA is in the range from 
1:100 to 110:100. The oilseed plant may further have a total seed fatty acid profile 
comprising less than 2.0% arachidonic acid. Also of interest are seeds obtained 
from such plants and oil obtained from the seeds of such plants. 
30 It is reasonable to believe that any integer ratio of EPAiDHA from 1 : 1 00 

through 860:100, or DHA:EPAfrom 1:100 through 110:100, might be obtainable in 
plants described or envisioned within the scope and spirit of the present invention. 

EXAMPLES 

The present invention is further defined in the following Examples, in which 
35 all parts and percentages are given as weight to volume, and degrees are Celsius, 
unless othenwise stated. It should be understood that these Examples, while 
indicating preferred embodiments of the invention, are given by way of illustration 
only. From the above discussion and these Examples, one skilled in the art can 
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ascertain the essential characteristics of this inverition, and without departing from 
the spirit and scope thereof, can make various changes and modifications of the 
invention to adapt it to various uses and conditions. Thus, various modifications of 
the invention in addition to those shown and described herein will be apparent to 
5 those sl<illed in the art from the foregoing description. Such modifications are also 
intended to fall within the scope of the appended claims. 

The disclosures contained within the references used herein are hereby 
incorporated by reference. 
GENERAL MATERIALS AND METHODS 

10 Procedures for nucleic acid phosphorylation, restriction enzyme digests, 

ligation and transformation are well known in the art. Techniques suitable for use in 
the following examples may be found in Sambrook, J., Fritsch, E. F. and Maniatis, 
T., Molecular Cloning: A Laboratory Manual . Second Edition, Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY (1989) (hereinafter "Maniatis"). 

15 Materials and methods suitable for the maintenance and growth of bacterial 

cultures are well known in the art. Techniques suitable for use in the following 
examples may be found as set out in Manual of Methods for General Bacterioloov 
(Phillipp Gerhardt, R. G. E. Murray, Ralph N. Costilow, Eugene W. Nester, Willis A. 
Wood, Noel R. Krieg and G. Briggs Phillips, eds), American Society for 

2t) Microbiology. Washington, DC. (1994)) or by Thomas D. Brock in Biotechnology: A 
. Textbook of Industrial Microbiology . Second Edition, Sinauer Associates, Inc., 
Sunderiand, MA (1989). All reagents, restriction enzymes and materials used for 
the growth and maintenance of bacterial and plant cells were obtained from Aldrich 
Chemicals (Milwaukee, Wl). DIFCO Laboratories (Detroit. Ml). GIBCO/BRL 

25 (Galthersburg. MD), or Sigma Chemical Company (St. Louis, MO) unless otherwise 
specified. 

The meaning~of'abbreviations is~as~follows: "h""or **hr" means hour(s), "min" 
or "min." means minute(s), "sec" or "s" means second(s), "d*' or "day" means day(s), 
"mL" means milliliters, "L" means liters. 

30 Bacterial Strains and Plasmids: 

E. co// TOP10 cells and E. co// electromax DH10B cells were obtained from 
Inyitrogen (Carlsbad, CA). Max Efficiency competent cells of E. coli DH5a were 
obtained from GIBCO/BRL (Galthersburg, MD). Plasmids containing EPA or DHA 
biosynthetic pathway genes were obtained from Ross Products Division, Abbott 

35 Laboratories, Columbus OH. The genes and the source plasmids are listed in 
Table 1. 
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TABLE 1 

EPA BIOSYNTHETIC PATHWAY GENES 



Gene 


... ,., — , 
Organism 


1 

Plasmid Name 


Reference 


Delta-6 desaturase 


S. diclina 


pRSP1 


WO 02/081668 


Delta-6 desaturase 


M. alpina 


pCGR5 


US Patent 5,968,809 


Elongase 


M. alpina 


pRPB2 


WO 00/12720 


Delta-5 desaturase 


M. alpina 


pCGR4 


US Patent 6,075,183 


Delta-5 desaturase 


S. diclina 


pRSP3 


WO 02/081668 


Delta-17 desaturase 


S. diclina 


pRSP19 


Example 6 


Elongase 


T. aureum 


pRAT-4-A7 


WO 02/08401 


Elongase 


Pavlova sp. 


PRPL-6-B2 


Example 13 


Delta-4 desaturase 


S. aggregatum 


pRSA1 


WO 02/090493 



Plasmids pKS102 and pKS121 are described in WO 02/00904. Plasmid 
5 pKS123 is described in WO 02/08269. Plasmid pCF3 is described in [Yadav, N.S. 
et al (1993) Plant Physiol. 103:467-76]. Cloning vector pCR-Scrlpt AMP SK(+) was 
from Stratagene (La Jolla, CA), Cloning vector pUC19 [Messing. J. (1983) Meth, 
EnzymoL 101:20] was from New England Biolabs (Beverly, MA). Cloning vector 
pGEM-T easy was from Promega (Madison, Wl). 
10 Growth Conditions: 

Bacterial cells were usually grown in Luria-Bertani (LB) medium containing 
1% of bacto-tryptone, 0.5% of bacto-yeast extract and 1% of NaCI. Occasionaly, 
bacterial cells were grown in SOC medium containing 2% of bactp-tryptone, 0.5% of 
bacto-yeast extract, 0.5% of NaCI and 20 mM glucose or in Superbroth (SB) 
15 containing 3,5% of bacto-tryptone, 2% of bacto-yeast extract, 0.05% of NaCI and 
0.005 M NaOH. 

Antibiotics were often added to liquid or solid media in order to select for 
plasmids or insertions with appropriate antibiotic resistance genes. Kanamycin, 
ampicillin and hygromycin were routinely used affinal concentrations of 50 pg/mL 
20 (Kan50), 100 pg/mL (AmplOO) or 50 pg/mL (Hyg50), respectively. 

EXAMPLE 1 
Isolation of soybean seed-specific promoters 
The soybean annexin and BD30 promoters were isolated with the Universal 
GenomeWalker system (Clontech) according to its user manual (PT3042-1). To 
25 make soybean GenomeWalker libraries, samples of soybean genomic DMA were 
digested with Dral, EcoRV, Pvull and Stul separately for two hours. After DNA 
purification, the digested genomic DNAs were ligated to the GenomeWalker 
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adaptors AP1 and AP2. 

Two gene specific primers (GSP1 and GSP2) were designed for soybean 
annexin gene based on the 5' coding sequences in annexin cDNA in DuPont EST 
database. The sequences of GSP1 and GSP2 are set forth in SEQ ID NOS:1 and 
5 2. 

GCCCCCCATCCTTTGAAAGCCTGT SEQ ID NO: 1 

CGCGGATCCGAGAGCCTCAGCATCTTGAGCAGAA SEQ ID NO:2 

10 

The AP1 and the GSP1 primers were used In the first round PGR using the 
conditions defined in the GenomeWalker system protocol. Cycle conditions were 
94^C for 4 minutes; 94**C for 2 second and 72^C for 3 minutes, 7 cycles; 94^C for 
2 second and 67°C for 3 minutes, 32 cycles; 67°C for 4 minutes. The products from 

15 the first run PGR were diluted 50-fold. One microliter of the diluted products were 
used as templates for the second PGR with the AP2 and GSP2 as primers. Cycle 
conditions were 94'='G for 4 minutes; 94^C for 2 second and 72*^0 for 3 min. 5 cycles; 
94°C for 2 second and 67^C for 3 minutes, 20 cycles; 67*'C for 3 minutes. A 2.1 kb 
genomic fragment was amplified and isolated from the EcoRV-digested 

2b GenomeWalker library. The genomic fragment was digested with BamH I and Sal I 
. and cloned into Bluescript KS+ vector for sequencing. The DNA sequence of this 
2012 bp soybean annexin promoter fragment is set forth in SEQ ID NO:3. 

Two gene specific primers (GSP3 and GSP4) were designed for soybean 
BD30 based on the 5' coding sequences in BD30 cDNA in NCBI GenBank 

25 (J05560). The oligonucleotide sequences of the GSP3 and GSP4 primers have the 
sequences set forth in SEQ ID NOS:4 and 5. 

GGTCCAATATGGAACGATGAGTTGATA SEQ ID NO:4 

30 CGCGGATCCGCTGGAACTAGAAGAGAGACGTAAGA SEQ ID NO:5 

The API and the GSP3 primers were used in the first round PGR using the 
same conditions defined in the GenomeWalker system protocol. The cycle 
conditions used for soybean annexin promoter do not work well for the soybean 
35 BD30 promoter in GenomeWalker experiment. A modified touchdown PGR protocol 
was used. Cycle conditions were: 94**C for 4 minutes; 94*'C for 2 second and 74*'G 
for 3 minutes. 6 cycles in which annealing temperature drops I^C every cycle; 94**C 
for 2 second and 69**C for 3 minutes, 32 cycles; Sg^'C for 4 minutes. The products 
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from the l^t run PGR were diluted 50-fold. One microliter of the diluted products 
were used as templates for the 2"^ PGR with the AP2 and GSP4 as primers. Gycle 
conditions were: 94^C for 4 minutes; 94°G for 2 second and 74°C for 3 min, 6 cycles 
in which annealing temperature drops 1°G every cycle; 94'^G for 2 second and 69"C 
for 3 minutes, 20 cycles; sg^'G for 3 minutes. A 1 .5 kb genomic fragment was 
amplified and isolated from the P\/ull-digested GenomeWaiker library. The genomic 
fragment was digested with 8amHl and Sa/I and cloned into Bluescript KS^ vector 
for sequencing. DNA sequencing determined that this genomic fragment contained 
a 1408 bp soybean BD30 promoter sequence (SEQ ID NO:6). 

Based on the sequences of the soybean p-conglycinin p-subunit promoter 
sequence in NGBl database (S44893), two oligos with either SamHI or Not\ sites at 
the 5' ends were designed to amplify the soybean p-conglycinin p-subunit promoter 
(SEQ ID NO:7). The oligonucleotide sequences of these two oligos are set forth in 
SEQ ID NOS: 8 and 9. 

CGGGGATGGTATATATGTGAGGGTAGAGGGTATGAC SEQ ID NO:8 

GAATTCGCGGCCGCAGTATATATATTATTGGACGATGAAAGATG SEQ ID NO:9 

Based on the sequences of the soybean Glycinin Gyl promoter sequence in 
the NGBl GenBank database (X15121), two oligos with either BamHI or Nott sites at 
the 5' ends were designed to amplify the soybean Glycinin Gy1 promoter (SEQ ID 
NO:10). The oligonucleotide sequences of these two oligos are set forth in SEQ ID 
NOS:11and12. 

GGGGGATGGTAGCGTAAGTACGTAGTGAAAATGGGA SEQ ID NO: 1 1 

GAATTGGCGGGGGGGGTGATGAGTGATGAGTGTTTAAGGAG SEQ ID NO: 12 

EXAMPLE 2 I 

Vector Gonstruction for Characterizing Strong. Seed-spe cific Promoters 
EPA can be produced at high levels in the seeds of important oil crops, such 
as soy, by strongly expressing each of the individual biosynthetic genes together, in 
a seed specific manner. To reduce the chance of co-suppression, each Individual 
gene can be operably linked to a different, strong, seed-specific promoter. Because 
the biosynthetic pathway leading to EPA involves the concerted action of a large 
number of different genes, it was necessary to first identify and characterize many 
different promoters that could then be used to express each EPA biosynthetic gene. 
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Promoters were identified and tested for their relative seed-speoific strengths by 
linl<lng them to the M. alpina delta-6 desaturase which, In these experiments, acted 
as a reporter gene. The M. alpina delta-6 desaturase can introduce a double bond 
between the C6 and C7 carbon atoms of linoleic acid (LA) and a-linolenic acid 
5 (ALA) to form y-linolenic acid (GLA) and stearidonic acid (STA), respectively. 
Because GLA and STA are not normally found In the lipids of soybean, their 
presence and concentration in soy was indicative of the relative strength of the 
promoter behind which the delta6 desaturase had been placed. Promoters tested in 
this way are listed in Table 2 and the plasmid construction for each Is described 
10 below. 



TABLE 2 

SEED-SPECIFIC PROMOTERS AND VECTORS 



Promoter 


Organism 


Vector Name f Promoter Reference 


P-conglycinIn a'-subunit 


Soy 


.pKR162 


Beachy et al., (1985) 
EMBO J. 4:3047-3053 


Kunitz Trypsin Inhibitor 


Soy 


pKR124 


Jofuku et a!., (1989) Plant 
Cell 1:1079-1093 


annexin 


Soy 


pJS92 


this reporti 


Glycinin Gy1 


Soy 


PZBL119 


this report 


Albumin 2S 


Soy 


pKR188 


US Patent 6,177,613 


Legumin A1 


Pea 


pKR189 


Rerieetal. (1991) Mol. 
Gen, Genet 225:148-157 


p-conglycinin p-subunit 


Soy 


ZBL118 


this report 


BD30 ( also called P34) 


Soy 


pJS93 


this report'' 


Legumin A2 


Pea 


pKR187 


Rerie et al. (1991) Mol. 
Gen. Genet 225:148-157 



TJhis also constitutes the subject matter of Applicant's Assignees's application having Application 
15 No. 60/446,833 (Attorney Docket No. BB1531PRV) filed concurrently herewith. 



The gene for the M. alpina delta-6 desaturase [was PCR-amplified from 
pCGR5 using primers oCGR5-1 (SEQ ID NO:13) and oCGR5-2 (SEQ ID NO:14), 
which were designed to introduce Not\ restriction enzyme sites at both ends of the 
20 delta-6 desaturase and an Ncol site at the start codon of the reading frame for the 
enzyme. 
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TTGCGGCCGCAAAQCATGGCTGCTGCTCCCAG (SEQ ID NO: 13) 

AAGCGGCCGCTTACTGCGCCTTAC (SEQ ID NO: 14) 

5 The resulting PGR fragment was subcloned into the intermediate cloning vector 
pGR-Script AMP SK(+) (Stratagene) according the manufacturer's protocol to give 
plasmid pKR159. Plasmid pKR159 was then digested with Not\ to release the 
M. alpina delta-6 desaturase, which was, in turn, cloned into the Not\ site of a 
selected soybean expression vector. Each expression vector tested contained a 

10 Notl site flanked by a suitable promoter and transcription terminator. Each vector 
also contained the hygromycin B phosphotransferase gene [Gritz, L. and Davies, J. 
(1983) Gene 25:179-188], flanked by the T7 promoter and transcription terminator 
(T7prom/hpt/T7term cassette), and a bacterial origin of replication (ori) for selection 
and replication in E. coli. In addition, each vector also contained the hygromycin B 

15 phosphotransferase gene, flanked by the 35S promoter [Odell et al., (1985) Nature 
313:810-812] and NOS 3' transcription terminator [Depicker et al., (1982) J. Mol. 
AppL Genet 1:561:570] (35S/hpt/NOS3' cassette) for selection in soybean. 

Vector pKR162 was constructed by cloning the Nott fragment of pKR159, 
containing the delta-6 desaturase, into the A/ofI site of vector KS123. Vector KS123 

20 contains a A/ofl site flanked by the promoter for the a' sijbunit of p-conglycinin and 
the phaseolin 3' transcription terminator elements Ocon/A/of//Phas3' cassette). 

Vector pl<R188 was constructed by cloning the Not\ fragment of pKR159, 
containing the delta-6 desaturase, into the Not\ site of vector pKR135. Vector 
pKR135 contains a Not\ site flanked by the 2S albumin promoter and the 2S 

25 albumin 3' transcription temninator elements (SA/A/of//SA3' cassette). Plasmid 
pKR135 was constructed by cloning the BamHl/Sal\ fragment of pKR132, 
containing the SA/A/of//SA3' cassette, into the BamH\/Sal\ site of pKS120. Plasmid 
pKS120 is identical to pKS123 except the Hind\\\ fragment containing the 
pcon/A/of//Phas3' cassette was removed. Plasmid pKR132, containing the 

30 SA/A/of//SA3' cassette flanked by SamHI and Sa/I sites, was constructed by cloning 
the Xba\ fragment of the SA/A/ofl/SA3' cassette, made by PGR amplification, into 
the Xba\ site of pUC1 9. The albumin promoter was amplified from plasmid AL3 
promoter::pBI121 (US Patent 6,177,613) using PGR. Primer oSAIb-9 (SEQ ID 
NO: 15) was designed to introduce an Xba\ site at the 5' end of the promoter, and 

35 oSAIb-3 (SEQ ID NO: 16) was designed to introduce a Notl site at the 3' end of the 
promoter. 
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ATCTAGACCTGCAGGCCAACTGCGTTTGGGGCTC (SEQ ID NO:1 5) 

CTTTTAACTTCGCGGCCGCTTGCTATTGATGGGTGAAGTG (SEQ ID NO: 1 6) 

5 The albumin transcription terminator was amplified from soy genomic DNA using- 
primer oSAIb-4 (SEQ ID NO:17), designed to introduce a Not\ site at the 5' end of 
the terminator, and primer oSAIb-2 (SEQ ID NO:18), designed to introduce Bsi\Nl 
and Xbal sites at the 3' end of the terminator 

10 CAATAGCAAGCGGCCGCGAAGTTAAAAGCAATGTTGTC (SEQ ID NO:17) 

AATCTAGACGTACGCAAAGGCAAAGATTTAAACTC (SEQ ID NO: 1 8) 

The resulting PGR fragments were then combined and re-amplified using primers 

15 oSAIb-9 and oSAIb-2, thus forming the SA/A/ofl/SA3' cassette, which was 
subsequently cloned into pUC19 to give pKR132. 

Vector pKR187 was constructed by cloning the Notl fragment of pKR159, 
containing the delta-6 desaturase, into the Notl site of vector pKR145. Vector 
pKR145 contains a Notl site flanked by the pea leguminA2 promoter and the pea 

20 leguminA2 3' transcription terminator (legA2/A/of//legA23' cassette). Plasmid 
pKR145 was constructed by cloning the BamHUSall fragment of pKR142. 
containing the legA2/A/of//iegA23' cassette, into the BamHUSall fragment of KS120 
(described above). The legA2/A/ofl/legA23' cassette of pKR142 was flanked by 
Ss/WI sites and contained a Pstl site at the extreme 5' end of legA2 promoter. In 

25 addition, this cassette was flanked by BamHl and Sail sites. Plasmid pKR142 was 
constructed by cloning the BsNMl fragment of pKR140, containing the 
legA2/A(o/l/legA23' cassette, into the Bs/WI site of pKR124. containing a bacterial 
on and ampicillin resistance gene. This cloning step introduced the Sa/I site and 
allowed further subcloning into pKS124. The legA2/A/ofl/legA23' cassette of 

30 pKR140 was made by PGR amplification from pea genomic DNA; The legA2 
promoter was amplified from pea genomic DNA using primer LegProS' (SEQ ID 
NO:19), designed to introduce Xbal and Bs/WI sites at the 5' end of the promoter, 
and primer LegPro3' (SEQ ID NO:20), designed to introduce a Notl site at the 3' 
end of the promoter. 

35 

TTTCTAGACGTACGTCCCTTCTTATCTTTGATCTCC (SEQ ID NO: 1 9) 



GCGGCCGCAGTTGGATAGAATATATGTTTGTGAC 

35 



(SEQ ID NO:20) 
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The legA2 transcription terminator was amplified from pea genomic DNA using 
primer LegTermS* (SEQ ID NO:21), designed to introduce A/ofI site at tlie 5' end of 
the terminator, and primer LegTemi3' (SEQ ID NO:22). designed to introduce es/WI 
5 andXbal sites at the 3' end of the terminator. 

CTATCCAACTGCGGCCGCATTTCGCACCAAATCAATGAAAG (SEQ ID NO:21) 
AATCTAGACGTACGTGAAGGTTAAACATGGTGAATATG (SEQ ID NO:22) 

10 

The resulting PGR fragments were then combined and re-amplified using primers 
LegProS' and LegTermS", thus forming the legA2/A/ofl/legA23' cassette. The 
legA2/A/ofl/legA23' cassette PGR fragment was subcloned into the intemnediate 
cloning vector pCR-Script AMP SK(+) (Stratagene) according the manufacturer's 

16 protocol to give piasmid pKR140. Plasmid pKR124 contains a A/ofl site flanked by 
the KTi promoter and the iCTi transcription termination region (KTi/A/o«/KTi3' 
cassette). In addition, the KTi/A/o«/KTi3' cassette was flanked by Bs/Wl sites. The 
KTi/A/ofl/KTi3' cassette was PCR-amplified from pKS126 using primers oKTiS (SEQ 
ID NO:23) and oKTi6 (SEQ ID NO:24), designed to introduce an Xba\ and Ss/WI 

20 site at both ends of the cassette. 

ATGTAGAGGTACGTCGTCGAAGAGAAGGG (SEQ ID NO:23) 

TTCTAGACGTACGGATATAATG (SEQ ID NO:24) 

25 

The resulting PGR fragment was subcloned into the Xba\ site of the cloning vector 
pUG19 to give plasmid pKR124. Plasmid pKS126 is similar to pKS121 
(WO 02/00904), the fomner possessing a second hygromycin phosphotransferase 
gene that is operably linked to a 35S-GaMV promoter. 

30 Vector pKRI 89 was constructed by cloning the Not\ fragment of pKRI 59, 

containing the delta-6 desaturase, into the Not\ site of vector pKR154. Vector 
pKR154 contains a Not\ site flanked by the pea leguminAI promoter and the pea 
leguminA2 3* transcription terminator (legA1/WoW/legA23' cassette). Vector pKR154 
was made by cloning the Hind\\\/Not\ fragment of pKR151, containing the legAI 3' 

35 promoter into the Hinm/Noti fragment of pKR150. Plasmid pKR1 51 contained a 
Non site flanked by the leguminAI promoter and the leguminAI 3' transcription 
terminator (legA1/A/ofl/legA1 3" cassette). In addition, the !egA1/A/ofl/legA1 3' 
cassette was flanked by 8s/WI site. The legA1/A/ofl/legA13' cassette was made by 

36 



i 

/ 

wo 2004/071467 PCTAJS2004/005758 

PCR amplification from pea genomic DNA. The legAI promoter was PCR-amplified 
using primer LegAI Pro5* (SEQ ID NO:25), designed to introduce Xba\ and 8s/WI 
sites at the 5' end of the promoter, and primer LegAI Pro3' (SEQ ID NO:26), 
designed to introduce a Not\ site at the 3' end of the promoter. 

TTTCTAGACGTACGGTCTCAATAGATTAAGAAGTTG (SEQ ID NO:25) 

GCGGCCGCGAAGAGAGATACTAAGAGAATGTTG (SEQ ID NO:26) 

The legAI transcription terminator was amplified from pea genomic DNA using 
primer LegAI Term5' (SEQ ID NO:27). which was designed to introduce Nott site at 
the 5' end of the terminator, and primer LegAI TermS' (SEQ ID NO:28). which was 
designed to Introduce Bsi\N\ and Xba\ sites at the 3' end of the terminator. 

GTATCTCTCTTCGCGGCCGCATTTGGCACCAAATCAATG (SEQ ID NO:27) 

TTTCTAGACGTACGTCAAAAAATTTCATTGTAACTC (SEQ ID NO:28) 

The resulting PCR fragments were then combined and re-amplified using primer 
LegAI Pro5' and LegAI TerniS*, thus forming the legAI /A/of l/legA1 3' cassette. The 
legA1/A/ofl/legA13' cassette PCR fragment was subcloned into the intermediate 
cloning vector pCR-Script AMP SK(+) (Stratagene) according the manufacturer's 
protocol to give plasmid pPL1A. The legA1/A/ofl/legA13' cassette was subsequently 
excised from pPL1A by digestion with Bs/WI and cloned into the Ss/WI site of 
pKR145 (described above) to give pKR161. Plasmid pKR150 was constructed by 
cloning the eamHI/H/ndlll fragment of pKR142 (described above), contiaining the 
legA2/A/ofl/legA23* cassette into the eamHI/H/ndlll site of KS120 (described above). 

The amplified soybean p-conglycinin p-subunit promoter fragment (as 
described in Example 1) was digested with BamH I and A/ofl. purified and cloned 
into the BamH I and Not\ sites of plasmid pZBL1 15 to make pZBL1 16. The 
pZBL115 plasmid contains the origin of replication from pBR322, the bacterial HPT 
hygromycin resistance gene driven by T7 promoter and T7 terminator, and a 35S 
Rromoter-HPT-Nos3' gene to serve as a hygromycin resistant plant selection 
marker. The Not I fragment of pKR159, containing the M. alpina delta 6 desaturase 
gene, was cloned into Not I site of pZBL1 16 in the sense orientation to make plant 
expression cassettes pZBL1 18. 

The amplified soybean glycinin Gyl promoter fragment (described in 
Example 1) was digested with BamH I and A/ofl. purified and cloned into the BamH I 
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and Not\ sites of plasmid pZBL1 15 to make pZBL1 17. The Not\ fragment of 
pKR159, containing the M. alpina delta-6 desaturase gene, was cloned into Not\ site 
of pZBL1 17 in the sense orientation to make plant expression cassettes pZBL1 19. 
Based on the sequence of the soybean annexin promoter (SEQ ID NO:3), as 
5 described in Example 1 , two oligos with either BamH I or A/ofl sites at the 5' ends 
were designed to re-amplify the promoter. The oligonucleotide sequences of these 
two oligos are shown in SEQ ID NO:29 and SEQ ID NO:30. 

CGCGGATCCATCTTAGGCCCTTGATTATATGGTGTTT (SEQ ID NO:29) 

10 

GAATTCGCGGCCGCTGAAGTATTGCTTCTTAGTTAACCTTTCC (SEQ ID NO:30) 

Based on the sequences of cloned soybean BD30 promoter (SEQ ID NO:6). 
as described in Example 1 , two oligos with either BamH I or Not I sites at the 5' 
15 ends were designed to re-amplify the BD30 promoter. The oligonucleotide 

sequences of these two oligos are shown in SEQ ID NO:31 and SEQ ID NO:32. 

CGCGGATCCAACTAAAAAAAGCTCTCAAATTACATTTTGAG (SEQ ID NO:31) 

20 GAATTCGCGGCCGCAACTTGGTGGAAGAATTTTAT(3ATTTGAAA (SEQ ID 
NO:32) 

The re-amplified annexin and BD30 promoter fragments were digested with 
BamH I and A/ofl, purified and cloned into the BamH I and Noli sites of plasmid 
26 pZBL1 15 to make pJS88 and pJS89, respectively. The pZBL1 15 plasmid contains 
the origin of replication from pBR322, the bacterial HPT hygromycin resistance 
gene driven by T7 promoter and T7 terminator, and a 35S promoter-HPT-Nos3' 
gene to serve as a hygromycin resistant plant selection marker. The M. alpina 
delta-6 desaturase gene was cloned into A/ofl site of pJSBB and pJS89, in the sense 
30 orientation, to make plant expression cassettes pJS92 and pJS93. respectively. 

EXAMPLE 3 

Cloning of Individual EPA Biosvnthetic Pathwav Genes for Exoression In Somatic 

Soybean Embryos 
Each of the EPA biosynthetic genes was tested individually in order to 
35 assess their activities in somatic soybean embryos before combining for large-scale 
production transformation into soybean. Each gene was cloned Into an appropriate 
expression cassette as described below. For the M alpina delta-5 desaturase and 
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elongase, both genes were combined together on one plasmid. The genes and 
promoters used, and the corresponding vector names are listed in Table 3. 

TABLE 3 



EPA BIOSYNTHETIC GENES EXPRESSED IN SOYBEAN SOMAT 


C EMBRYOS 


Activity 


Source 

Organism 


Sequence 
(DNA) 


Sequence 
(Protein) 


Vector 


Delta-6 
desaturase 


M. alpha 


SEQ ID NO:33 


SEQ ID NO:34 


pKR162 


Delta-6 
desaturase 


S. diclina 


SEQ ID NO:35 


SEQ ID NO:36 


pKS208 


Delta-5 
desaturase 


S. diclina 


SEQ ID NO:37 


SEQ ID NO:38 


pKR305 


elongase 


T. aureum 


SEQ ID NO:39 


SEQ ID NO:40 


pKS209 


Delta-17 
desaturase 


S. diclina 


SEQ ID NO:41 


SEQ ID NO:42 


pKS203 


elongase 


M. alpina 


SEQ ID NO:43 


SEQ ID NO:44 


pKS134 


Delta-5 
desaturase 


M. alpina 


SEQ ID NO:45 


SEQ ID NO:46 


pKS134 



Construction of pKR162, for soy expression studies with the M. alpina delta-6 
desaturase, was described in Example 2. 

The S. diclina delta-6 desaturase was cloned into the A/ofI site of the 
Pcon/A/of//Phas3' cassette of vector pKS123. The gene for the S. diclina delta-6 
desaturase was removed from pRSP1 by digestion with EcoRI and HindiW. The 
ends of the resulting DNA fragment were filled and the fragment was cloned into the 
filled A/of I site of pKS123 to give pKS208. 

To release the S. diclina delta-5 desaturase from plasmid pRSP3, it was first 
digested with Xhoi, the Xhol ends were filled, and the plasmid was then digested 
with EcoRI. The deita-5 desaturase-containing fragnlient was then cloned into 
pKR288 that had been digested with Mfel and EcoRV to give pKR305. Plasmid 
pKR288 was identical to pKS123 except that a linker containing the Mfel (on the 
promoter side) and EcoRV (on the 3' terminal side) sites had been Inserted into the 
Notl site of the pcon/A/of//Phas3' cassette. This allowed for directional cloning of 
the delta-5 desaturase, which contained internal Not\ sites, into pKS123. 
Construction of pKR288 is more thoroughly described In Example 1 3. 
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The r. aureum elongase was cloned into the Notl site of the 
pcon/A/of//Phas3' cassette of vector pKS123. The gene for the T. aureum elongase 
was removed from pRAT-4-A7 by digestion with EcoRI. The ends of the resulting 
DNA fragment were filled and the fragment was cloned into the fillejd Notl site of 
5 pKS123 to give pKS209. 

The gene for the S. diclina delta-17 desaturase (Example 6) was amplified 
from pRSPig using primers RSP19fonA/ard (SEQ ID NO:53) and RSP19reverse 
(SEQ ID NO:54) which were designed to introduce Not\ restriction enzyme sites at 
both ends of the delta-1 7 desaturase. 

10 

GCGGCCGCATGACTGAGGATAAGACGA (SEQ ID NO:53) 
GCGGCCGCTTAGTCCGACTTGGCCTTG (SEQ ID NO:54) 

15 The resulting PGR fragment was subcloned into the intermediate cloning vector 
pGEM-T easy (Promega) according the manufacturer's protocol to give plasmid 
pRSP19/pGEM. The gene for the S. diclina delta-17 desaturase was released from 
pRSP19/pGEM by partial digestion with Notl and cloned into the A/ofl site of . 
pKS123 to give pKS203. 

20 In plasmid pKS134, both the M. alpina elongase and M. alpina delta-5 

desaturase were cloned behind the p-conglycinin promoter followed by the 
phaseolin 3' transcription terminator O con/Maelo/Phas3' cassette, 
pcon/Mad5/Phas3* cassette). Plasmid pKS134 was constructed by cloning the 
H/ndlll fragment of pKS129, containing the pcon/Mad5/Phas3' cassette, into a 

25 Ay/ndlll site of partially digested pKS128, containing the pcon/[\/laelo/Phas3' 

cassette, the T7prom/hptyT7term cassette and the bacterial on region. The gene for 
the M alpina elongase was amplified from pRPB2 using primers RPB2foward (SEQ 
ID NO:55) and RPB2reverse (SEQ ID NO:56) which were designed to introduce 
Not\ restriction enzyme sites at both ends of the elongase. 

30 

GCGGCCGCATGGAGTCGATTGCGC (SEQ ID NO:55) 

GCGGCCGCTTACTGCAACTTCCTT (SEQ ID NO:56) 

35 The resulting PGR fragment was digested with Notl and cloned into the Nofl site of 
pKS119, containing a pcon/A/ofl/Phas3' cassette, the T7prom/hpt/T7temi cassette 
and the bacterial on region, to give pKS128. Plasmid pKS119 is identical to 
pKS123, except that the 35S/HPT/NOS3* cassette had been removed. The gene 

40 



wo 2004/071467 



PCTAJS2004/005758 



for the M. alpina delta-5 desaturase was amplified from pCGR4 using primers 
CGR4foward (SEQ ID NO:57) and CGR4reverse (SEQ ID NO:58) which were 
designed to introduce Notl restriction enzyme sites at both ends of the desaturase, 

5 GCGGCCGCATGGGAACGGACCAAG (SEQ ID NO:57) 

GCGGCCGCCTACTCTTCCTTGGGA (SEQ ID NO:58) 

The resulting PGR fragment was digested with Notl and cloned into the Nott site of 
10 pKS1 19, containing a pcon/A/ofl/Phas3' cassette flanked by H/ndlll sites, to give 
pKS129. 

EXAMPLE 4 

Assembling EPA biosvnthetic pathway genes for expression in Somatic Soybean 

15 Embryos and Soybean Seeds (pKR274) 

The M. alpina delta-6 desaturase, M. alpina elongase and M. alpina deIta-5 
desaturase were cloned into plasmid pKR274 (Figure 3) behind strong, seed- 
specific promoters allowing for high expression of these genes In somatic soybean 
embryos and soybean seeds. The delta-6 desaturase was cloned behind the 

20 promoter for the rf subunit of p-conglycinin [Beachy et al., (1 985) EMBO J. 

4:3047-3053] followed by the 3' transcription termination region of the phaseolin 
gene [Doyle, J.J. et al. (1986) J. BioL Cheni. 261:9228-9238] (pcon/Mad6/Phas3' 
cassette). The delta-5 desaturase was cloned behind the Kunitz soybean Trypsin 
Inhibitor (KTi) promoter [Jofuku et al., (1989) Plant Cell 1:1079-1093], followed by 

25 the KTi 3' termination region, the isolation of which is described in US Patent 

6,372,965 (KTi/Mad5/KTi3* cassette). The elongase was cloned behind the glyclnin 
Gyl promoter followed by the pea leguminA2 3' termination region 
(Gy1/Maelo/legA2 cassette). All of these promoters exhibit strong tissue specific 
expression in the seeds of soybean. Plasmid pKR274 also contains the hygromycin 

30 B phosphotransferase gene [Gritz, L. and Davies, J. (1983) Gene 26:179-188] 

cloned behind the T7 RNA polymerase promoter and followed by the T7 terminator 
(T7prom/HPTrr7term cassette) for selection of the plasmid on hygromycin B in 
certain strains of E. coli, such as NovaBlue(DE3) (Novagen, Madison, Wl), which is 
lysogenic for lambda DE3 (and carries the T7 RNA polymerase gene under lacUV5 

35 control). In addition, plasmid pKR274 contains a bacterial origin of replication (on) 
functional in E. coli from the yector pSP72 (Stratagene). 

Plasmid pKR274 was constructed in many steps from a number of different 
Intermediate cloning vectors. The Gy1/Maelo/legA2 cassette was released from 
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plasmid pKR270 by digestion with Ss/WI and Sbf\ and was cloned into the 
Bsi\N\/Sbfl sites of plasnnid pKR269, containing the delta-6 desaturase, the 
T7prom/hpt/T7term cassette and the bacterial on region. This was designated as 
plasmid pKR272. The KTi/Mad5/KTi3' cassette, released from pKR136 by digestion 
5 with Bs/WI, was then cloned into the Bsi\N\ site of pKR272 to give pKR274. A 
description for plasmid construction for pKR269, pKR270 and pKR136 is provided 
below. 

Plasmid pKR159 (described in Example 2) was digested with Noil to release 
the M. alpina delta-6 desaturase. which was, in turn, cloned into the Not\ site of the 

10 soybean expression vector pKR197 to give pKR269. Vector pKR197 contains a 
pcon/Nof//Phas3' cassette, the T7prom/hptnTterm cassette and the bacterial ori 
region. Vector pKR197 was constructed by combining the Asc\ fragment from 
plasmid pKS102 (WO 02/00905), containing the T7prom/hpt/T7term cassette and 
bacterial on, with the >Ascl fragment of plasmid pKR72, containing the 

15 pcon/A/of//Phas cassette. Vector pKR72 is identical to the previously described 

vector pKS123 (WO 02/08269). except that Sdfl, Fsel and Bsl\N\ restriction enzyme 
sites were introduced between the HindlW and BamHI sites in front of the p- 
conglycinin promoter. 

The gene for the M. alpina elongase was PCR-amplified (described in 

20 Example 3) digested with Natl and cloned into the Not\ site of vector pKR263 to give 
pKR270. Vector pl<R263 contains a Noti site flanked by the promoter for the 
glycininGyl gene and the leguminA2 3' transcription termination region 
(Gy1/A/of//legA2 cassette). In addition, the Gy1/Afofl/iegA2 cassette was flanked by 
Sbf\ and SsA/VI sites. Vector pKIR263 was constructed by combining the Pstt/Nott 

25 fragment from plasmid pKR142, containing the l6guminA2 3' transcription 

.termination region, an ampicillin resistance gene arid bacterial ori with the PstUNotl 
fragment of plasmid pSGIy12, containing the glycininGyl promoter. The 
glycininGyl promoter was amplified from pZBL119 (described in Example 2) using 
primer oSGIy-1 (SEQ ID NO:59), designed to introduce an Sbf\/Pst\ site at the 5' 

30 end of the promoter, and primer oSGIy-2 (SEQ ID NQ:60), designed to introduce a 
NofI site at the 3' end of the promoter. 

TTCCTGCAGGCTAGCCTAAGTACGTACTC (SEQ ID NO:59) 

35 AAGCGGCCGCGGTGATGACTG (SEQ ID NO:60) 



The resulting PGR fragment was subcloned into the intermediate cloning 
vector pCR-Script AMP SK(+) (Stratagene) according the manufacturer's protocol to 
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give plasmid pSGIy12. Construction of pKR142, containing the legA2/A/ofI/legA23' 
cassette is described in Example 2. The gene for the M alpina delta-5 desaturase 
was PCR-amplified as described in Example 3, digested with Notl and cloned into 
the Notl site of vector pKR124 (described in Example 2) to give pKR136. 
5 EXAMPLE 5 

Assemblino EPA biosvnthetic pathwav cenes for expression in Som atic Sovbean' 

Embryos and Sovbean Seeds (pKKE2) 
The S. diclina delta-6 desaturase, M. alpina elongase and M. alpina delta-5 
desaturase were cloned into plasmid pKKE2 (Figure 4) behind strong, seed-specific 
10 promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. Plasmid pKKE2 was identical to pKR274, described in 
Example 4, except that in pKKE2 the M alpina delta-6 desaturase was replaced 
with the S. diclina delta-6 desaturase. As in pKR274, the S. diclina delta-6 
desaturase was cloned behind the promoter for the a' subunit of p-conglyclnin 
15 followed by the 3' transcription termination region of the phaseolin gene 
(pcon/SddB/PhasS* cassette). 

Plasmid pKKE2 was constructed from a number of different intermediate 
cloning vectors as follows: The pcon/Sdd6/Phas3' cassette was released from 
plasmid pKS208 (described in Example 2) by digestion with HindWl and was cloned 
20 into the HindWl site of plasmid pKR272 (Example 3) to give pKR301 . The 

KTi/Mad5/KTi3' cassette, released from pKR136, (Example 4) by digestion with 
Ss/WI. was then cloned Into the Bs/WI site of pKR301 to give pKKE2. 

EXAMPLE 6 

Cloning of S.diclina (ATCC 56851 > delta-17 desaturase 
25 Construction of Saproleania diclina (ATCC 56851^ cDNA Library 

To isolate genes encoding for fun ctiona l desaturase enzymes, a^cDNA library 
was constructed. Saprolegnia diclina cultures were grown in potato dextrose media 
(Difco # 336. BD Diagnostic Systems, Sparks, MD) at room temperature for four 
days with constant agitation. The mycelia were harvested by filtration through 
30 several layers of cheesecloth, and the cultures were crushed in liquid nitrogen using 
a mortar and pestle. The cell lysates were resuspended in RT buffer (Qiagen, 
Valencia, California) containing p-mercaptoethanol and incubated at 55'C for 
three minutes. These lysates were homogenized either by repeated aspirations 
through a syringe or over a "Qiashredder^'-brand column (Qiagen). The total RNA 
35 was finally purified using the "RNeasy Maxi'-brand kit (Qiagen), as per the 
manufacturer's protocol. 

mRNA was isolated from total RNA from each organism using an oligo dT 
cellulose resin. The "pBluescript II XR"-brand library construction kit (Stratagene, La 
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Jolla, CA) was used tq synthesize double-stranded cDNA. The double-stranded 
cDNA was then directionally cloned (5* EcoRI/3' Xhol) into pBluescript li SK(+) 
vector (Stratagene). The S. diclina library contained approximately 2.5 x 10^ clones, 
each with an average insert size of approximately 700 bp. Genomic DNA of S. 
5 diclina was isolated by crushing the culture in liquid nitrogen followed by purification 
using the "Genomic DNA Extraction"-brand kit (Qiagen), as per the manufacturer's 
protocol. 

Determination of Codon Usage in Saproleania diclina 

The 5' ends of 350 random cDNA clones were sequenced from the 

10 Saprolegnia diclina cDNA library described above. The sequences were translated 
into six reading frames using GCG program (Genetics Computer Group, Madison, 
Wl) with the "FastA"-brand algorithm to search for similarity between a query 
sequence and a group of sequences of the same type, specifically within the 
GenBank database. Many of the clones were identified as putative housekeeping 

15 genes based on protein homology to known genes. Eight S. diclina cDNA 
sequences were thus selected. Additionally, the full-length S. diclina delta 5- 
desaturase and delta e-desaturase sequences were also used (see Table 4 below). 
These sequences were then used to generate the S. diclina codon bias table shown 
in Table 2 below by employing the "CodonFrequency" program from GCG (Madison, 

20 Wl). 



TABLE 4 

GENES FROM Saprolegnia diclina USED IN CODON BIAS TABLE 



Clone 


Database Match 


# bases 


# amino 
acids 


3- 


Actin gene - - 


615 - 


205 


20 


Ribosomal protein L23 


420 


140 


55 


Heat Shock protein 70 gene 


468 


156 


83 


Glyceraldehyde-3-P-dehydrogenase 
gene 


588 


196 


138 


Ribosomai protein S1 3 gene 


329 


110 


179 


Alpha-tubulin 3 gene 


591 


197 


190 


Casein l<inase II alpha subunit gene 


627 


209 


250 


Cyclophilin gene 


489 


163 




Delta 6-desaturase 


1362 


453 
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Clone 


Database Match 


# bases 


# amino 
acids 




Delta 5-desaturase 


1413 


471 




Total 


6573 


2191 



TABLE 5 



CODON 



Amino acid 


Codon Bias 


% used 


Ala 


GCC 


55% 


Ara 


CGC 


50% 


Asn 


AAC 


94% 


ASD 


GAC 


85% 


Cvs 


TGC 


77% 


Gin 


CAG 


90% 


Glu 


GAG 


80% 


Gly 


GGC 


67% 


His 


CAC 


86% 


lie 


ATC 


82% 


Leu 


CTC 


52% 


Lys 


AAG 


87% 


Met 


ATG 


100% 


Phe 


TTC 


72% 


Pro 


- CCG 


55% 


Ser 


TCG 


47% 


Thr 


ACG 


46% 


Trp 


TGG 


100% 


Tyr 


TAC 


90% 


Val 


GTC 


73% 


Stop 


TGA 


67% 
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DpRi qn of Degenerate. Oligonucleotides for the Isolat ion of an Omeaa-3 Desaturase 
from Saproleania diclina (ATCC 56851) 

The method for identification of a delta-17 desaturase (an omega-3 
desaturase) gene from S. diclina involved PGR amplification of a region of the 
5 putative desaturase gene using degenerate oligonucleotides (primers) that 

contained conserved motifs present in other known omega-3 desaturases. Omega- 
3 desaturases from the following organisms were used for the design of these 
degenerate primers: Arabldopsis thaliana (Swissprot # P46310), Ricunus communis 
(Swissprot# P48619), Glycine max (Swissprot* P48621), Sesamum indicum 

10 (Swissprot # P48620), Nicotiana tabacum (GenBank # D79979), Perilla frutescens 
(GenBank # U59477), Capsicum annuum (GenBank # AF222989). Limnanthes 
douglassi (GenBank # U17063). and Caenorhabditis elegans (GenBank # L41807). 
Some primers were designed to contain the conserved histidine-box motifs that are 
important components of the active site of the enzymes. See Shanklin, J.E., 

15 McDonough, V.M., and Martin, C.E. (1994) Bioctiemistry 33, 12787-12794. 

Alignment of sequences was carried out using the CLUSTALW !\/lultiple 
Sequence Alignment Program (Thompson, J.D. et al. (1994) Nucl. Acids Res. 
22:4673-4680). 

The following degenerate primers were designed and used in various 
20 combinations: 

Protein IVIotif 1 : NH3- TRAAIPKHCWVK -COOH (SEQ ID NO:61 ) 

Primer RO 1144 (Fonward): 
25 ATCCGCGCCGCCATCCCCAAGCACTGCTGGGTCAAG (SEQ ID NO: 62) 

Protein Motif 2: NH3- ALFVLGHDCGHGSFS -COOH (SEQ ID NO:63) 

This primer contains the histidine-box 1 (underlined). 

30 Primer RO 1119 (Forward): ' 

GCCCTCTTCGTCCTCGGCCAYGACTGCGGCCAYGGCTCGTTCTCG 

(SEQ ID NO: 64). 

Primer RO 1 1 1 8 (Reverse): 
35 GAGRTGGTARTGGGGGATCTGGGGGAAGARRTGRTGGRYGACRTG 

(SEQ ID NO: 65). 
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Protein Motif 3: NH3- PYHGWRIS HRTHH QN -COOH (SEQ ID NO:66) 

This primer contains tlie iiistidine-box 2 (underlined). 

Primer RO 1 121 (Fonward): 
5 CCCTACCAYGGCTGGCGCATCTCGCAYCGCACCCAYCAYCAGAAC 

(SEQ ID NO: 67). , 
Primer RO 1 122 (Reverse): 

GTTCTGRTGRTGGGTCCGRTGCGAGATGCGCCAGCCRTGGTAGGG 
10 (SEQ ID NO: 68). 

Protein Motif 4: NH3- GSHF D/H P DA' SDLFV -COOH (SEQ ID NO:69) 

Primer RO 1 146 (Forward): 
15 GGCTCGCACTTCSACCCCKACTCGGACCTCTTCGTC (SEQ ID NO: 70). 

Primer RO 1 147 (Reverse): 

GACGAAGAGGTCCGAGTMGGGGTWGAAGTGGGAGCC (SEQ ID NO: 71). 
2!b Protein Motif 5: NH3- WS Y/F LTV RGGLTT L/l DR -COOH (SEQ ID NO:72) 
Primer RO 1148 (Reverse): 

GCGCTGGAKGGTGGTGAGGCCGCCGCGGAWGSACGACCA 
(SEQ ID NO: 73) 

25 

Protein Motif 6: NH3- HHDIG THVIHH LFPQ -COOH (SEQ ID NO:74) 
^his-seqaence contains-the third-histidine-box-(underlined): 

Primer RO 1114 (Reverse): 
30 CTGGGGGAAGAGRTGRTGGATGACRTGGGTGCCGATGTCRTGRTG 

(SEQ ID NO: 75). 

Protein Motif 7: NH3- H UF FP Q/K IPHYHL V/l EAT -COOH (SEQ ID NO:76) 

35 Primer RO 1116 (Reverse): 

GGTGGCCTCGAYGAGRTGGTARTGGGGGATCTKGGGGAAGARRTG 

(SEQ ID NO: 77). 

47 



wo 2004/071467 



PCT/US2004/005758 



Protein Motif 8: NH3- HVA/I HH L/F FPQIPHYHL ^COOH (SEQ ID NO:78) 

This primer contains tlie third histidine-box (underlined) and accounts for differences 

between the plant omege-3 desaturases and the C. elegans onnega-3-desaturase. 

The nucleic acid degeneracy code used for SEQ. ID. NOS: 62 through 77 was as 
follows. R= A/G; Y=C/T; M=A/C; K=G/T; W=Arr; S=C/G; B=C/GyT; D=A/G/T; 
H=A/C/T; V=A/C/G; and N=A/C/GyT. 

Identification and Isolation of Delta-17 Desaturase Gene from Saproleania diclina 
fATCC 56851) 

Various sets of the degenerate primers above were used in PGR 
amplification reactions, using as a template either the S. diclina cDNA library 
plasmld DISIA, or S. diclina genomic DNA. Also various different DNA polymerases 
and reaction conditions were used for the PGR amplifications. These reactions 
variously involved using "Platinum Taq"-brand DNA polymerase (Life Technologies 
Inc., Rockville, MD), or cDNA polymerase (Glontech, Palo Alto, GA), or Taq PGR- 
mix (Qiagen), at differing annealing temperatures. 

PGR amplification using the primers RO 1121 (Fonward) (SEQ. ID. NO:67) 
and RO 1 1 16 (Reverse) (SEQ. ID. NO:77) resulted in the amplification of a fragment 
homologous to a icnown omega-3 desaturase. The RO 1121 (Forward) primer 
corresponds to the |!>rotein motif 3; the RO 1116 (Reverse) primer corresponds to 
the protein motif 7. 

PGR amplification was carried out in a 50 ^il total volume containing: 3 |jI of 
the cDNA library template, PGR buffer containing 40 mM Tricine-KOH (pH 9.2), 
15 mM KOAc, 3.5 mM Mg(OAc)2, 3.75 \^g/ml BSA (final concentration), 200 pM 
each deoxyribonucleotide triphosphate, 10 pmole of each primer and 0.5 pi of 
"Advantage"-brand cDNA polymerase (Glontech). Amplification was carried out as 
follows: initial denaturation at 94"G for 3 minutes, followed by 35 cycles of the 
following: 94°G for 1 min, 60°G for 30 sec, 72'G for 1 min. A final extension cycle of 
72''G for 7 min was carried out, followed by reaction termination at 4^*0. 

A single -480 bp PGR band was generated which was resolved on a 1 % 
"SeaKem Gold"-brand agarose gel (FMC BioProducts, Rockland, ME), and gel- 
purified using the Qiagen Gel Extraction Kit. The staggered ends on the fragment 
were Tilled-in" using T4 DNA polymerase (Life Technologies, Rockville. MD) as per 
the manufacturer's instructions, and the DNA fragments were cloned into the PGR- 
Blunt vector (Invitrogen, Garlsbad, CA). The recombinant plasmids were 
transformed into TOP10 supercompetent cells (Invitrogen), and eight clones were 
sequenced and a database search (Gen-EMBL) was canried out. 
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Clones "sdd17-7-r to "sdd17-7-8' were all found to contain and -483 bp 
insert. The deduced amino acid sequence from this fragment showed highest 
identity to the following proteins (based on a "tFastA" search): 

1. 37.9% identity in 161 amino acid overlap with an omega-3 (delta-15) 
5 desaturase from Synechocystis sp. (Accession # D13780). 

2. 40.7% identity in 1 13 amino acid overlap with Picea abies plastidic 
omega-3 desaturase (Accession # AJ302017). 

3. 35,9% identity In 128 amino acid overlap with Zea mays FAD8 fatty acid 
desaturase (Accession # D63953). 

10 - Based on its sequence homology to known omega-3 fatty acid desaturases, it 

seemed likely that this DNA fragment was part of a delta-17 desaturase gene 
present in S. diclina. 

The DNA sequence identified above was used in the design oligonucleotides 
to isolate the 3' and the 5' ends of this gene from the S. diclina cDNA library. To 

15 isolate the 3' end of the gene, the following oligonucleotides were designed: 

RO 1188 (Fonward): 5'-TACGCGTACCTCACGTACTCGCTCG-3' (SEQ ID NO: 79) 

RO 1 189 (FoHAfard): TTCTTGCACCACAACGACGAAGCGACG (SEQ ID NO: 80) 

RO 1190 (Fonward): GGAGTGGACGTACGTCAAGGGCAAC (SEQ ID NO: 81) 

RO 1191 (Fonrt^ard): TCAAGGGCAACCTGTCGAGCGTCGAC (SEQ ID NO: 82) 

25 These primers (SEQ ID NOS: 79-82) were used in combinations with the 

pBluescript SK(+) vector oligonucleotide: 

RO 898: 5'-CCCAGTCACGACGTGTAAAA CGACGGCCAG-3' (SEQ ID NO: 83). 

PGR amplifications were carried out using either the "Taq PGR Master Mix" 
brand polymerase (Qiagen) or "Advantage"-brand cDNA polymerase (Glontech) or 
30 "Platinum'-brand Taq DNA polymerase (Life Technol^ogies), as follows: 

For the "Taq PGR Master Mix" polymerase, 10 pmoles of each primer were 
used along with 1 pi of the cDNA library DNA from Example 1 . Amplification was 
carried out as follows: initial denaturation at 94**G for 3 min, followed by 35 cycles of 
the following: 94^*0 for 1 min, 60**G for 30 sec, 72*^0 for 1 min. A final extension 
35 cycle of 72°G for 7 min was carried out, followed by the reaction tennination at 4*'G. 
This amplification resulted in the most distinct bands as compared with the other 
two conditions tested. 
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For the "Advantage"-brancl cDNA polymerase reaction, PGR amplification 
was carried out in a 50 pi total volume containing: 1 \s\ of tlie cDNA library template 
from Example 1. PGR buffer containing 40 mM Tricine-KOH (pH 9.2). 15 mM KOAc, 
3.5 mM Mg(OAc)2. 3.75 |jg/ml BSA (final concentration), 200 \iM each 
deoxyribonucleotide triphosphate. 10 pmote of each primer and 0.5 pi of cDNA 
polymerase (Clontech). Amplification was carried out as described for the Taq PGR 
Master Mix. 

For the "Platinum"-brand Taq DNA polymerase reaction, PGR amplification 
was carried out in a 50 pi total volume containing: 1 \^\ of the cDNA library template 
from Example 1, PGR buffer containing 20 mM Tris-CI. pH 8.4, 50 mM KCl (final 
concentration). 200 pM each deoxyribonucleotide triphosphate. 10 pmole of each 
primer, 1.5 mM MgS04, and 0.5 [i\ of Platinum Taq DNA polymerase. Amplification 
was carried out as follows: Initial denaturation at 94°''G for 3 min. followed by 
30 cycles of the following: 94°G for 45 sec, 55°C for 30 sec, 68°G for 2 min. The 
reaction was terminated at 4°C. 

All four sets of primers in combination with the vector primer generated 
distinct bands. PGR bands from the combination (RO 1 188 + RO 898) were 
>500 bp and this was gel-purified and cloned separately. The PGR bands • 
generated from the other primer combinations were <500 bp. The bands were gel- 
purified. pooled together, and cloned into PGR-Blunt vector (Invitrogen) as 
described eariier. The recombinant plasmids were transformed into TOP10 
supercompetent cells (Invitrogen) and clones were sequenced and analyzed. 

Clone '"sdd 17-1 6-4" and "sdd16-6" containing the -500 bp insert, and clones 
"sdd17-17-6," ''sdd17-17-10," and "sdd17-20-3." containing the -400 bp inserts, 
were all found to contain the 3'-end of the putative delta-17 desaturase. These 
sequences overlapped with each other, as well as with the originally identified 
fragment of this putative omega-3 desaturase gene. All of the sequences contained 
the TAA' stop codon and a poly-A tail typical of 3'-ends of eukaryotic genes. This 
3'-end sequence was homologous to other known omega-3 desaturases, sharing 
the highest identity (41 .5% In 1 30 amino acid overiap) with the Synechocystis delta- 
15 desaturase (Accession # D13780). 

For the isolation of the 5'-end of the this gene, the following oligonucleotides 
were designed and used in combinations with the pBluescript SK(+) vector 
oligonucleotide: 

RO 899: 5*- AGGGGATAAGAATTTGAGAGAGGAAACAGG -3' (SEQ ID NO: 84) 

RO 1185 (Reverse):GGTAAAAGATCTGGTGGTTGTGGATGTTGC (SEQ ID NO: 
85). 

50 



wo 2004/071 467 PCTAJS2004/005758 



RO 1186 (Reverse): 5'-GTCAAAGTGGCTCATCGTGC-3' (SEQ ID NO: 86) 

RO 1187 (Reverse): CGAGCGAGTACGTGAGGTACGCGTAC (SEQ ID NO: 87) 

5 ' \ 

Amplifications were carried out using either the "Taq PGR Master Mix"-brand 
polymerase (Qiagen) or the "Advantage"-brand cDNA polymerase (Clontech) or the 
"Platinum"-brand Taq DNA polymerase (Life Technologies), as described 
hereinabove for the 3' end isolation. 

10 PGR bands generated from primer combinations (RO 1185 or RO 1 186 + RO 

899) were between -580 to -440 bp and these were pooled and cloned into PGR- 
Blunt vector as described above, Glones thus generated included "sddl 7-14-1 
"sdd 17-1 4-10," "sdd 17-1 8-2," and "sdd17-18-8." all of which showed homology with 
known omega-3 desaturases. 

15 Additionally, bands generated from (RO 1187 + RO 899) were - 680 bp. and 

these were cloned separately to generate clones "sdd 17-22-2" and "sdd 17-22-5" 
which also showed homology with known omega-3 desaturases. All these clones 
overiapped with each other, as well as with the original fragment of the unknown 
putative delta-17 desaturase. These sequences contained an 'ATG' site followed by 

2'b an open reading frame, indicating that it is the start site of this gene. These 

fragments showed highest identity (39.7% in 146 amino acid overiap) with the delta- 
15 desaturase from Calendula officinalis (Accession # AJ245938). 

The full-length reading frame for this delta-17 desaturase was obtained by 
PGR amplification of the S. diclina cDNA library using the following oligonucleotides: 

25 

RO 1212 (Forward): 

-5^-TGAAGAGAATTGATGAGGGAGGATAAGAGGAAGGTGGAGTTGGGG-3' 
(SEQ ID NO: 88) 

30 This primer contains the 'ATG' start site (single underiine) followed by the 5' 

sequence of the omega-3 desaturase. In addition, an EcoRI site (double underiine) 
was introduced upstream of the start site to facilitate cloning into the yeast 
expression vector pYX242. 

35 RO 1213 (Reverse): 

5VAAAAGAAAGGTTGGGTTGGTAGTGIIAGTGGGAGTTGGGCTTGGG-3' 

(SEQ ID NO: 89) 
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This primer contains the TAA' stop codon (single underline) of the gene as well as 
sequence downstream from the stop codon. This sequence was included because 
regions within the gene were very G+C rich, and thus could not be included In the 
design of oligonucleotides for PGR amplification. In addition, a H/ndll! site (double 
underline) was included for convenient cloning into a yeast expression vector 
pYX242. 

PGR amplification was carried out using the "Taq PGR Master Mix"-brand 
polymerase (Qiagen), 10 pmoies of each primer, and 1 |jI of the cDNA library DNA 
from Example 1. Amplification was carried out as follows: initial denaturation at 
94°C for 3 min, followed by 35 cycles of the following: 94°G for 1 min, eO'C for 30 
sec, 72°C for 1 min. A final extension cycle of 72°C for 7 min was carried out, 
followed by the reaction termination at 4°C. 

A PGR band of ~1 kb was thus obtained and this band was isolated, purified, 
cloned into PGR-Blunt vector (Invitrogen). and transfonned into TOP10 cells. The 
inserts were sequenced to verify the gene sequence. Glone "sdd 17-27-2" was 
digested with EcoRI / H/ndlll to release the full-length insert, and this insert was 
cloned into yeast expression vector pYX242. previously digested with EcoRI / 
H/ndlll. This construct contained 1077 bp of sdd17 cloned into pYX242, This 
construct was labeled pRSP19. 

EXAMPLE 7 

Assembly of EPA biQsvnthetic pathway genes for exp ression in Somatic Soybean 
Embryos and Soybean Seeds fDKR275) 
The Ambidopsis Fad3 gene [Yadav. N.S. et al. (1993), Plant Physiol. 
103:467-761 and S. diclina deIta-17 desaturase were cloned into plasmid pKR275 
(Figure 5) behind strong, seed-specific promoters allowing for high expression of 
tjiesegmesjn sonTaticAoybean_ejTibryos and soybean seeds. The Fad3 gene 
SEQ ID NO:47, and its protein translation product in SEQ ID NO:48, was cloned 
behind the KTi promoter, and upstream of the KTI 3* termination region 
(KTi/Fad3/KTi3' cassette). The S. diclina delta-1.7 desaturase was cloned behind 
the soybean annexin promoter followed by the soy BP30 3' tennination region 
(Ann/Sdd17/BD30 cassette). Plasmid pKR275 also contains a mutated fomn of the 
soy acetolactate synthase (ALS) that is resistant to sulfonylurea herbicides. ALS 
catalyzes the first common step in the biosynthesis of the branched chain amino 
acids isoleucine, leucine, and valine (Keeler et al. Plant Physiol 1993 102: 1009-18). 
Inhibifion of native plant ALS by several classes of structurally unrelated hertsicides 
including sulfonylureas, imidazolinones, and triazolopyrimidines, is lethal (Ghong 
GK, Choi JD Biochem Biophys Res Commun 2000 279:462-7). Overexpression of 
the mutated sulfonylurea-resistant ALS gene allows for selection of transformed 
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plant cells on sulfonylurea herbicdes. The ALS gene is cloned behind the SAMS 
promoter (described in WO 00/37662). This expression cassette is set forth in SEQ 
ID NO:90. In addition, plasmid pKR275 contains a bacterial ori region and the 
T7prom/HPT/T7term cassette for replication and selection of the plasmid on 
5 hygromycin B in bacteria. 

Plasmid pKR275 was constructed from a number of different intermediate 
cloning vectors as follows: The KTi/Fad3/KTi3' cassette was released from plasmid 
pKR201 by digestion with Bs/WI and was cloned into the BsN\l\ site of plasmid 
pKR226, containing the ALS gene for selection, the T7prom/hpt/T7term cassette 

10 and the bacterial ori region. This was designated plasmid pKR273. The 

Ann/Sdd17/BD30 cassette, released from pKR271 by digestion with Pstl, was then 
cloned into the Sbfl site of pKR273 to give pKR275. A detailed description for 
plasmid construction for pKR226, pKR201 and pKR271 is provided below. 

Plasmid pKR226 was constructed by digesting pKR218 with Bs/WI to remove 

15 the legA2/Notl/legA3' cassette. Plasmid pKR218 was made by combining the filled 
Hind\\\/Sbn fragment of pKR217, containing the legA2//\/ofl/IegA23' cassette, the 
bacterial ori and the T7prom/HPTyT7term cassette, with the Pstl/Smal fragment of 
pZSLISIeuB. containing the SAMS/ALS/ALS3' cassette. Plasmid pKR217 was 
constructed by cloning the BamHI/Hindlll fragment of pKR142 (described in 

26 Example 2), containing the legA2/A/ofl/legA23' cassette, into the eamHI/H/ndlll site 
of KS102. The Arabidopsis Fad3 gene was released from vector pKS131 as a Not\ 
fragment and cloned into the A/ofI site of pKR124 (described in Example 2) to fonn 
pKR201 . The Not\ fragment from pKS131 is identical to that from pCF3 [Yadav, 
N.S. et al (1993) Plant Physiol. 103:467«761) 

25 The gene for the S. diclina delta-17 desaturase was released from 

pRSP19/pGEM (described in Example 2) by partial digestion with Nott, and it was 
then cloned into the Noil site of pKR268 to give pKR271. Vector pKR268 contains 
a Not\ site flanked by the annexin promoter and the BD30 3' transcription 
termination region (Ann/A/of//BD30 cassette). In addition, the Ann/A/ofl/BD30 

30 cassette was flanked by Pst\ sites. I 

To construct pKR268, the annexin promoter from pJS92 was released by 
BamHI digestion and the ends were filled. The resulting fragment was ligated into 
the filled BsiWI fragment of pKR124 (described in Example 2), containing the 
bacterial ori and ampicillin resistance gene, to give pKR265. This cloning step 

35 added Sbfl, Psfl and Ss/WI sites to the 5' end of the annexin promoter. The 

annexin promoter was released from pKR265 by digestion with Sbfl and Nott and 
was cloned into the Sbfl/Not\ fragment of pKR256, containing the BD30 3' 
transcription terminator, an ampicillin resistance gene and a bacterial ori region, to 
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give pKR268. Vector pKR256 was constructed by cloning an EcoR\/Not\ fragment 
from pKR251r, containing the BD30 3' transcription terminator, into the EcoRUNotl 
fragment of intermediate cloning vector pKR227. This step also added a Pst\ site to 
the 3' end the BD30 3' transcription terminator. Plasmid pKR227 vvas derived by 
ligating the Sail fragment of pJS93 containing soy BD30 promoter (WO 01/68887) 
with the San fragment of pUC19. The BD30 3* transcription terminator was PCR- 
amplified from soy genomic DNAusing primer oSBD30-1 (SEQ ID NO:91), designed 
to introduce an Noti site at the 5' end of the terminator, and primer oSBD30-2 (SEQ 
ID NO:92), designed to introduce a Bs/WI site at the 3' end of the terminator. 

TGCGGCCGCATGAGCCG (SEQ ID NO:91) 

ACGTACGGTACCATCTGCTAATATTTTAAATC (SEQ ID NO:92) 

The resulting PGR fragment was subcloned into the intermediate cloning vector 
pCR-Script AMP SK(+) (Stratagene) according the manufacturer's protocol to give 
plasmid pKR251r. 

EXAMPLE 8 

Assembling EPA biosvnthetic oathwav genes for expression in Somatic Soybean 

Embrvos-DKR328 & pKR329 
The EPA biosynthetic genes were tested in combination in order to assess 
their combined activities in somatic soybean embryos before large-scale production 
transformation into soybean. Each gene was cloned into an appropriate expression 
cassette as described below. 

Plasmid pKR329 was similar to pKR275. described in detail in Example 4, in 
that-it-eontained-the same4<Ti/Fad3/KTi3' and Ann/Sddl 7/BD30 cassettes allowing 
for strong, seed specific expression of the Arabidopsis Fad3 and S. diclina delta17 
desaturase genes. It also contained the T7prom/HPT/T7term cassette and a 
bacterial or/. Plasmid pKR329 differed from pKR275 in that it contained the 
hygromycin phosphotransferase gene cloned behind the 35S promoter followed by 
the NOS 3' untranslated region (35S/HPT/NOS3' cassette) instead of the 
SAMS/ALS/ALS3' cassette. The 35S/HPT/NOS3' cassette allowed for selection of 
transformed plant cells on hygromycin-containing media. 

Plasmid pKR329 was constructed in many steps from a number of different 
intermediate cloning vectors. The KTi/Fad3/KTi3' cassette was released from 
plasmid pKR201 (Example 7) by digestion with Ss/WI and was cloned into the 
Ss/WI site of plasmid pKR325, containing the 35S/HPT/NOS3' cassette, the 
T7prom/hpt/T7term cassette and bacterial ori. This was called plasmid pKR327. 
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The Ann/Sdd17/BD30 cassette, released from pKR271 (Example 3) by digestion 
with Psfl, was then cloned into the Sbf\ site of pKR327 to give pKR329. Plasmid 
pKR325 was generated from pKR72 (Example 4) by digestion with H/ndlll to 
remove the pcon/A/ofl/Phas3' cassette. 
5 Plasmid pKR328 was identical to pKR329, described above, except that it did 

not contain the KTi/Fad3/KTi3' cassette. The Ann/Sdd17/BD30 cassette, released 
from pKR271 (Example 3) by digestion with Psfl, was cloned into the Sbf\ site of 
pKR325 (described above) to give pKR328. 

EXAMPLE 9 

10 Transformation of Somatic Sovbean Embrvo Cultures 

Culture Conditions 

Soybean embryogenic suspension cultures (cv. Jacl<) were maintained in 

35 ml liquid medium SB196 (see recipes below) on rotary shaker, 150 rpm, 26''C 

with cool white fluorescent lights on 16:8 hr day/night photoperiod at light intensity of 
15 60-85 |jE/m2/s. Cultures are subcultured every 7 days to two weeks by inoculating 

approximately 35 mg of tissue into 35 ml of fresh liquid SB196 (the preferred 

subculture interval is every 7 days). 

Soybean embryogenic suspension cultures were transformed with the 

plasmids and DNA fragments described in the following examples by the method of 
20 particle gun bombardment (Klein et al. 1987; Nature, 327:70). A DuPont Biolistic 

PDS1000/HE instrument (helium retrofit) was used for all transformations. 

Sovbean Embryogenic Suspension Culture Initiation 

Soybean cultures were Initiated twice each month with 5-7 days between 

each initiation. 

25 Pods with immature seeds from available soybean plants 45-55 days after 

planting were picked, removed from their shells and placed into a sterilized magenta 
box^ The soybean seedswere sterilizecTby sTiakinglhem fo minufeis in a 5% 
Clorox solution with 1 drop of ivory soap (95 ml of autoclaved distilled water plus 
5 ml Clorox and 1 drop of soap). Mix well. Seeds were rinsed using 2 1-liter bottles 

30 of sterile distilled water and those less than 4 mm were placed on individual 

microscope slides. The small end of the seed was cut and the cotyledons pressed 
out of the seed coat. Cotyledons were transferred to plates containing 881 medium 
(25-30 cotyledons per plate). Plates were wrapped with fiber tape and stored for 
8 weeks. After this time secondary embryos were cut and placed into SB196 liquid 

35 media for 7 days. 

Preparation of DNA for Bombardment 

Either an intact plasmid or a DNA plasmid fragment containing the genes of 
interest and the selectable marker gene was used for bombardment Plasmid DNA 
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for bombardment was .routinely prepared and purified using the method described In 
the Promega™ Protocols and Applications Guide, Second Edition (page 106). 
Fragments of pKR274 (Example 4). pKKE2 (Example 5) and pKR275 (Example 7) 
were obtained by gel isolation of double digested plasmids. In each case, 100 ug of 
5 plasmid DNA was digested in 0.5 ml of the specific enzyme mix described below. 
Plasmid pKR274 (Example 4) and pKKE2 (Example 5) were digested with >Ascl 
(100 units) and EcoRI (100 units) in NEBuffer 4 (20 mM Tris-acetate, 10 mM 
magnesium acetate, 50 mM potassium acetate, 1 mM dithiothreitol, pH 7.9), 
100 ug/ml BSA, and 5 mM beta-mercaptoethanoi at 37*^0 for 1.5 hr. Plasmid 

10 pKR275 (Example 7) was digested with Asc\ (100 units) and Sgfi (50 units) in 
NEBuffer 2 (10 mM Tris-HCI, 10 mM MgCl2, 50 mM NaCI, 1 mM dithiothreitol, 
pH 7.9), 100 ug/ml BSA, and 5 mM beta-mercaptoethanol at 37°C for 1 .5 hr. The 
resulting DNA fragments were separated by gel electrophoresis on 1% SeaPlaque 
GTG agarose (BioWhitaker Molecular Applications) and the DNA fragments 

1 5 containing EPA biosynthetic genes were cut from the agarose gel. DNA was 
purified from the agarose using the GELase digesting enzyme following the 
manufacturer's protocol. 

A 50 pi aliquot of sterile distilled water containing 3 mg of gold particles (3 mg 
gold) was added to 5 pi of a 1 pg/M' DNA solution (either intact plasmid or DNA 

20 fragment prepared as described above), 50 pi 2.5M CaCl2 and 20 pi of 0.1 M 

spermidine. The mixture was shaken 3 min on level 3 of a vortex shaker and spun 
for 10 sec In a bench microfuge. After a wash with 400 pi 100% ethanol the pellet 
was suspended by sonication in 40 pi of 100% ethanol. Five pi of DNA suspension 
was dispensed to each flying disk of the Biolistic PDS1000/HE instrument disk. 

25 Each 5 pi aliquot contained approximately 0.375 mg gold per bombardment (i.e. per 
disk),^ 

Tissue Preparation and Bombardment with DNA 

Approximately 150-200 mg of 7 day old embryonic suspension cultures were 
placed In an empty, sterile 60 x 15 mm petri dish and the dish covered with plastic 
30 mesh. Tissue was bombarded 1 or 2 shots per plate jwith membrane rupture 

pressure set at 1 100 PSl and the chamber evacuated to a vacuum of 27--28 inches 
of mercury. Tissue was placed approximately 3.5 inches from the retaining / 
stopping screen. 

Selection of Transformed Embryos 
35 Transfonned embryos were selected either using hygromycin (when the 

hygromycin phosphotransferase, HPT, gene was used as the selectable marker) or 
chlorsulfuron (when the acetolactate synthase, ALS, gene was used as the 
selectable marker). 
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Hvqromvcin (HPT) Selection 

Following bombardment, the tissue was placed into fresh SB196 media and 
cultured as described above. Six days post-bombardment, the SB196 is exchanged 
with fresh SB196 containing a selection agent of 30 mg/L hygromycin. The 
5 selection media is refreshed weekly. Four to six weeks post selection, green, 
transformed tissue may be observed growing from untransformed, necrotic 
embryogenic clusters. Isolated, green tissue was removed and inoculated into 
multiwell plates to generate new, clonally propagated, transformed embryogenic 
suspension cultures. 

10 Chlorsulfuron (ALS) Selection 

Following bombardment, the tissue was divided between 2 flasks with fresh 
SB196 media and cultured as described above. Six to seven days post- 
bombardment, the SB196 was exchanged with fresh SBig6 containing selection 
agent of 100 ng/ml Chlorsulfuron. The selection media was refreshed weekly. Four 

15 to six weeks post selection, green, transformed tissue may be observed growing 
from untransformed. necrotic embryogenic clusters. Isolated, green tissue was 
removed and inoculated into multiwell plates containing SB196 to generate new, 
clonally propagated, transformed embryogenic suspension cultures. 
Regeneration of Sovbean Somatic Embrvos into Plants 

2!b In order to obtain whole plants from embryogenic suspension cultures, the 

tissue must be regenerated. 
Embryo Maturation 

Embryos were cultured for 4-6 weeks at 26°C in SB196 under cool white 
fluorescent (Phillips cool white Econowatt F40/CW/RS/EW) and Agro (Phillips F40 

25 Agro) bulbs (40 watt) on a 16:8 hr photoperiod with light intensity of 90-120 uE/m2s. 
After this time embryo clusters were removed to a solid agar media, SB166, for 
— 1-2'weeks.— eiusters'were then subcultured to medium SB103 for 3 weeks. During 
this period, individual embryos can be removed from the clusters and screened for 
alterations in their fatty acid compositions as described in Example 11. It should be 

30 noted that any detectable phenotype, resulting from the expression of the genes of 
interest, could be screened at this stage. This would include, but not be limited to, 
alterations in fatty acid profile, protein profile and content, carbohydrate content, 
growth rate, viability, or the ability to develop normally into a soybean plant. 
Embrvo Desiccation and Germination 

35 Matured individual embryos were desiccated by placing them into an empty, 

small petri dish (35 x 10 mm) for approximately 4-7 days. The plates were sealed 
with fiber tape (creating a small humidity chamber). Desiccated embryos were 
planted into SB71-4 medium where they were left to germinate under the same 
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culture conditions described above. Germinated plantlets were removed from 
germination medium and rinsed thorouginly with water and then planted in Redi- 
Earth in 24-cell pack tray, covered with clear plastic dome. After 2 weeks the dome 
was removed and plants hardened off for a further week. If plantlets looked hardy 
they were transplanted to 10" pot of Redi-Earth with up to 3 plantlets per pot. After 
10 to 16 weeks, mature seeds were harvested, chipped and analyzed for fatty acids 
as described in Examples 10 and 11. 
Media Recipes 

SB 196 - FN Lite liquid proliferation med ium foer liter) - 



MS FeEDTA - 100x Stock 1 


10 ml 


MS Sulfate - lOOx Stock 2 


10 ml 


FN Lite Halides - 100x Stock 3 


10 ml 


FN Lite P,B,Mo - 100x Stock 4 


10 ml 


85 vitamins (Iml/L) 


1.0 ml 


2.4-D (lOmg/L final concentration) 


1.0 ml 


KN03 


2.83 gm 


(NH4 )2 SO 4 


0.463 gm 


Asparagine 


1.0 gm 


Sucrose (1%) 


10 gm 



pH 5.8 
FN Lite Stock Solutions 



Stock # 
1 



1000ml 



500ml 



2 



MS Fe EDTA 100x Stock 

Na2 EDTA* 

FeS04 - 7H2O 

* Add first, dissolve ii 
MS Sulfate lOOx stock 



3.724 g 1.862 g 

2.784 g 1.392 g 

in dark bottle while stirring 



MgS04 - 7H2O 
MnS04 - H2O 
ZnS04 - 7H2O 
CUSO4 - 5H2O 



37.0 g 
1.69 g 
0.86 g 



18.5 g 
0.845 g 
0.43 g 



0.0025 g 



0.00125 g 



3 



FN Lite Halides 100x Stock 



CaCl2 - 2H2O 
Kl 



C0CI2 - 6H2O 
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30.0 g 
0.083 g 
0.0025 g 



15.0 g 
0.0715 g 
0.00125 g 



I 



wo 2004/071467 PCT/US2004/005758 

4 FN Lite P,B,Mo 100x Stock 

KH2PO4 18.5 g 9.25 g 

H3BO3 0.62 g 0.31 g 

Na2Mo04 - 2H2O 0.025 g 0.01 25 g 

5 

SB1 solid medium (per literV 

1 pkg. MS salts (Gibco/ BRL - Cat# 1 1 1 17-066) 
1 ml 85 vitamins 1000X stock 
31.5 g sucrose 

10 2 ml 2.4-D (20mg/L final concentration) ^ 

pH5.7 
8 g TC agar 

SB 166 solid medium (per liter) - 
15 1 pkg. MS salts (Gibco/ BRL - Cat# 1 1 1 17-066) 

1 ml B5 vitamins 1000X stock 
60 g maltose 

750 mg MgCI2 hexahydrate 
5 g activated charcoal 
2^ pH5.7 

2 g gelrite 

SB 103 solid medium (per liter) — 

1 pkg. MS salts (Gibco/BRL - Cat# 1 1 1 17-066) 
25 1 ml B5 vitamins 1000X stock 

60 g maltose 

750 mg-MgCI2~hexahydrate ~ ~ 

pH 5.7 

2 g gelrite 

30 

SB 71-4 solid medium (per liter) — I 

1 bottle Gamborg's B5 salts w/ sucrose (Gibco/BRL - Cat# 211 53-036) 

pH5.7 

5 g TC agar 

35 

2.4-D stock 

- obtained premade from Phytotech cat# D 295 — concentration is 1 mg/ml 
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B5 Vitamins Stock (per 100 mH - stc c alignots at -20C 
10 g myo-inositol 
100 mg nicotinic acid 
100 mg pyridoxine HCi 
1 g tliiamlne 

If the solution does not dissolve quickly enough, apply a low level of heat via 
the hot stir plate. 



Chlorsulfuron Stock 

-Img / ml In 0.01 N Ammonium Hydroxide 

EXAMPLE 10 

Analysis of Somatic Soy Embrvos c ontaining various promoters driving M. alpina 

delta-6 desaturase 
Mature somatic soybean embryos are a good model for zygotic embryos. 
While in the globular embryo state in liquid culture, somatic soybean embryos 
contain very low amounts of triacylglycerol or storage proteins typical of maturing, 
zygotic soybean embryos. At this developmental stage, the ratio of total 
triacyiglyceride to total polar lipid (phospholipids and glycolipid) is about 1:4, as is 
typical of zygotic soybean embryos at the developmental stage from which the 
somatic embryo culture was initiated. At the globular stage as well, the mRNAs for 
the prominent seed proteins, a'-subunit of p-conglycinin, kunitz trypsin Inhibitor 3. 
and seed lectin are essentially absent. Upon transfer to hormone-free media to 
allow differentiation to the maturing somatic embryo state, triacylglycerol becomes 
the most abundant lipid class. As well, mRNAs for a'-subunit of p-conglycinin, 
kunitz trypsin inhibitor 3 and seed lectin become very abundant messages in the 
total mRNA population. On this basis somatic soybean embryo system behaves 
very similarly to maturing zygotic soybean embryos in vivo, and is therefore a good 
and rapid model system for analyzing the phenotypic effects of modifying the 
expression of genes in the fatty acid biosynthesis pathway. Most importantly, the 
model system is also predictive of the fatty acid composition of seeds from plants 
derived from transgenic embryos. 

Transgenic somatic soybean embryos containing the M alpina delta-6 
desaturase expression vectors described in Example 2 were prepared using the 
methods described In Example 9. Fatty acid methyl esters were prepared from 
single, matured, somatic soy embryos by transesterification. Embryos were placed 
in a vial containing 50 pL of trimethylsulfonium hydroxide (TMSH) and 0.5 mL of 
hexane and were incubated for 30 minutes at room temperature while shaking. 
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Fatty acid methyl esters (5 pL injected from hexane layer) were separated and 
quantified using a Hewlett-Packard 6890 Gas Chromatograph fitted with an 
Omegawax 320 fused silica capillary column (Supeico Inc., Cat#24152). The oven 
temperature was programmed to hold at 220^C for 2.7 min, increase to 240°C at 
5 20''C /min and then hold for an additional 2.3 min. Carrier gas was supplied by a 
Whatman hydrogen generator. Retention times were compared to those for methyl 
esters of standards commercially available (Nu-Chek Prep, Inc. catalog #U-99-A). 
The amount of GLA accumulated in embryo tissue was used as an indicator of the 
strength of each individual promoter. As indicated in Table 6, all of the promoters 
10 tested were capable of driving expression of the M alpina delta-6 desaturase. 

TABLE 6 

GLA Accumulation in Soybean Somatic Enibryos: M. alpina delta-6 desaturase 
gene linked to various promoters 

15 



Promoter 


GLA (% fatty 
acid) 


Soy a -subunit p- 


40+ 


conglycinin 




Soy KTi 3 


40+ 


Soy Annexin 


40 


Soy Glycinin 1 


35 


Soy 2S albumin 


22 


Pea Legumin A1 


10 


Soy P'-subunit p- 


9 


conglycinin 




Soy BD30 


_ 


Pea Legumin A2 


3 



EXAMPLE 11 

Analysis of transgenic Somatic Sov Embrvos and Seed Chios containing EPA 

Biosvnthetic Genes 

20 Transgenic somatic soybean embryos containing the expression vector 

pKR275 (Example 7) and either pKR274 (Example 4) or pKKE2 (Example 5) were 
prepared using the methods described in Example 9. 

A portion of the somatic soy embryos from each line generated was 
harvested and analyzed for fatty acid composition by GC as described in Example 
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10. Approximately 10 embryos were analyzed for each individual transformation 
event. Fatty acids were identified by comparison of retention times to those for 
authentic standards. In this way, 471 events were analyzed for pKR274/pKR275 
and 215 events were analyzed for pKKE/pKR275. From the 471 lines analyzed for 
5 pKR274/pKR276, 10 were identified that produced EPA (average of 10 individual 
embryos) at a relative abundance greater than 7% of the total fatty acids. The best 
line analyzed averaged 9% EPA with the best embryo of this line having 13% EPA. 
From the 215 lines analyzed for KKE/KR275, 1 1 lines were identified that produced 
EPA (average of 10 individual embryos) at a relative abundance greater than 9% of 

10 the total fatty acids. The best line analyzed averaged 13% EPA with the best 
embryo of this line having 16% EPA. The best EPA-producing events from each 
construct set are shown in Table 7. In Table 7, clones 3306-2-3 to 3324-1-3 are 
pKR274/pKR275 events and 3338-6-3 to 3338-6-24 are pKKE2 events. Fatty acids 
in Table 7 ar defined as X:Y where X is the fatty acid chain length and Y Is the 

15 number of double bonds. In addition, fatty acids from Table 7 are further defined as 
follows where the number in parentheses corresponds to the position of the double 
bonds from the carboxyl end of the fatty acid: 18:1=18:1(9), 18:2=18:2(9,12), 
GLA=18:3(6,9,12), 18:3=18:3(9,12,15), STA= 18:4(6,9, 12, 15), HGLA=20:3(8,1 1.14). 
ARA=20:4(5.8,11,14), ETA=20:4(8,1 1,14,17). EPA=20:5(5,8,1 1,14.17) and 

20 DPA=22:5(7, 10,13,16,19). Fatty acids listed as "others" include: 20:0, 20:1(5), 

20:2(11,14), 20:3 (5,11,14). 20:3 (11.14.17). 20:4 (5,11,14.17), and 22:0. For KKE2 
events each of these fatty acids is present at relative abundance of less than 1% of 
the total fatty acids. For KR274/275 each of these fatty acids is present at relative 
abundance of less than 1% of total fatty acids except for events 3306-5-2, 3319-6-1, 

25 3319-2-13 in which 20:3 (11,14,17) and 20:4 (5,11.14.17) are both in the range of 
1 .1 to 2.2% of total fatty acids. 
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TABLE 7 

Fatty acid analyses of transgenic soybean somatic embryos producing C20 PUFAs 



Clone ID 


16:0 


18:0 


18:1 


18:2 


GLA 


18:3 


STA 




AS9A 


C f 




f-ID A 


Others 


3306-2-3 


14.9 


2.3 


6.3 


15.8 


21.7 


11.5 


4.5 


4.8 


U.O 


O 7 
<,f 




iJe 




3306-6-2 


14.2 


4.4 


11.7 


19,4 


4.6 


20.8 


1.5 


1.5 


n 4 
U.Z 


•f IS 




4.2 


5.3 


3319-3-1 


18.2 


2.9 


11.0 


19.1 


15.6 


14.5 


3.4 


1.8 


4 Q 

1 .0 


U.O 


0 A 


0.6 


1 .2 


3319-6-1 


11.1 


3.7 


16.6 


12.9 


lU.f 


1 £,.1 


o.o 


ti. n 
o.u 


0.8 


2.8 


9.3 


2.0 




3319-2-13 


12.7 


3.3 


17.5 


14.2 


10.8 


15.9 


3.1 


2.4 


o!i 


2!8 


8!o 


i!i 


3.3 


3319-2-16 


12,7 


2,5 


8.5 


18.1 


10.3 


12.1 


2.3 


3.4 


4.0 


1.0 


7-3 


2.5 


2.3 


3319-3-6 


11.7 


2.0 


10.1 


13.2 


11.5 


7.7 


1.9 


2.8 


0.7 


1.8 


9.3 


1.8 


3.3 


3320-6-1 


15.3 


3.7 


13.5 


10.7 


14.8 


12.4 


4.5 


6.6 


1.4 


2.4 


8.0 


1.2 


2.4 


3322-5-2 


13.9 


2.9 


14.4 


15.6 


17,4 


13.8 


3.5 


2.9 


0.2 


1.8 


8.1 


0.9 


2.2 


3324-1-3 


12.0 


4.4 


18.6 


17.6 


13.9 


7.8 


1.8 


4.8 


0.3 


3.4 


8.1 


0.8 


2.9 


3338-6-3 


14.3 


3.2 


18.1 


11.0 


13.7 


8.8 


3.0 


5.1 


0.2 


5.3 


9.6 


1.2 


2.1 


3338-7-11 


20.5 


2.9 


9.9 


10.6 


8.9 


17.3 


3.8 


2.0 


0.4 


3.0 


12.8 


1.8 


1.9 


3338-7-12 


16.5 


2.1 


15.2 


16.4 


16.1 


11.5 


2.5 


1.7 


0.2 


2.0 


10.0 


0.8 


1.2 


3338-3-4 


20.2 


3.0 


6.7 


11.0 


9.9 


10.5 


3.9 


4.6 


1.8 


3.1 


12.0 


3.2 


2.1 


3338-3-5 


14.7 


2.2 


12.4 


12.4 


17.6 


10.8 


4.7 


2.9 


1.3 


1.4 


10.0 


0.9 


1.8 


3338-6-10 


13.7 


1.8 


12.4 


8.3 


16,4 


14.0 


5.8 


3.2 


0.3 


4.0 


1ZA 


1.2 


2.2 


3338-6-12 


13.9 


2.4 


13.1 


9.4 


22.7 


5.7 


3.1 


4.0 


0.4 


3.3 


13.3 


0.9 


1.5 


3338-7-21 


14,8 


1.7 


8.4 


13.1 


20.2 


12.5 


4.8 


3.9 


0.4 


3.6 


11.6 


0.6 


2 


3338-7-30 


16.4 


2.8 


18.9 


12.9 


9.6 


10.1 


2.4 


2.3 


0.5 


2.3 


13.0 


2.6 


2.4 


3338-1-4 


14.1 


2.1 


10.8 


26.3 


13.8 


9.6 


1.9 


3.3 


1.1 


1.9 


10.1 


1.0 


1.3 


3338-6-24 


25.1 


4.5 


13.3 


4.0 


15.5 


3.1 


2.6 


5.3 


0,7 


4.0 


13.0 


0.9 


1.7 



Mature plants were regenerated from the highest EPA-producing embryos as 
5 described in Example 10, and the fatty acid analyses were performed on chips of 
the seeds from the regenerated plants. The results for six seeds from three plants 
are presented in Table 8. Seeds of control plants possessed fatty aid profiles typical 
of normal soybean, in which linolenic acid (18:3) was the most highly unsaturated 
fatty acid that was detectable. Seeds produced from plants that had a reconstituted 
10 pathway for C20 PUFAs had as much as 25% of their total fatty acid in the form of 
C20 material. Combined levels of EPA and DPA were frequently greater than 15%, 
and were as high as 23,5% of the total. 



TABLE 8 



Event 


16:0 


18:0 


18:1 


18:2 


GLA 


18:3 


STA 


HGLA ARA 


ETA 


EPA 


bPA 


Other EPA+ 






























DPA 


3338-3-4-7 


14.4 


8.5 


19.7 


9.1 


9.1 


3.1 


1.2 


6.6 


1.0 


2.4 


18.8 


4.1 


2.0 


22.9 




13.2 


5.5 


18.6 


10.4 


11.7 


3.3 


1.1 


10.1 


2.2 


2.4 


19.6 


0.8 


1.2 


20.4 




15.6 


9.0 


13.9 


16.6 


6.6 


7.1 


0.0 


3.9 


0.0 


1.8 


15.5 


4.2 


5.8 


19.7 




22.4 


8.8 


20.8 


14.2 


5.0 


3.8 


0.6 


3.0 


1.0 


1.1 


14.0 


3.1 


2.2 


17,1 




13.2 


7.5 


27.0 


12.8 


9.0 


2.8 


0.9 


5.7 


1,.8 


1.2 


11.2 


4.0 


2.9 


15.2 




15.2 


4.9 


18.3 


12.3 


13.3 


3.5 


1.3 


10.5 


5.3 


2.4 


12.9 


0.0 


0.0 


12.9 


3338-7-11-11 


13.0 


7.1 


13.6 


13.1 


13.0 


5.9 


1.7 


5.2 


0.5 


0.4 


16.4 


4.3 


5.8 


20.7 




12.9 


7.3 


13.1 


14.9 


9.6 


7.2 


1.7 


5.9 


0.8 


0.6 


14.3 


4.7 


7.0 


18.9 




12.4 


7.6 


15.9 


12.6 


13.6 


5.4 


1.8 


6.0 


0.5 


0.0 


15.2 


3.7 


5.2 


18.9 




15.0 


5.9 


18.4 


16.0 


10.2 


8.4 


1.7 


4.0 


0.6 


0.0 


13.9 


2.4 


3.5 


16.3 




13.8 


5.9 


19.6 


18.0 


7.2 


10.8 


1.5 


3.4 


0.4 


0.0 


10.8 


3.2 


5.5 


14.0 




16.2 


6.2 


15.2 


22.4 


6.9 


9.2 


1.1 


3.4 


0.8 


0.0 


11.7 


2.2 


4.6 


13.9 
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Event 


16:0 


18:0 


18:1 


18:2 


GLA 


1d:o 


oTA 


HGLA ARA 


ETA 


CD A 
CPA 


UrA 


other EPA+ 






























DPA 


3339-5-3-7 


13.7 


8.1 


6.9 


8.1 


16.5 


4.7 


1.8 


7.1 


0.7 


2.2 


19.5 


4.0 


6.7 


23.6 




15.4 


6.9 


11.8 


16.4 


10.0 


4.3 


0.8 


4.7 


1.2 


1.4 


16.3 


3.5 


7.3 


19.8 




14.7 


6.3 


13.6 


18.1 


8.1 


3.1 


0.9 


4.3 


2.1 


0.1 


14.9 


4.2 


9.6 


19.1 




12.3 


6.5 


20.9 


13.1 


15.1 


3.0 


1.0 


6.1 


1.2 


1.4 


10.6 


1.4 


7.3 


12.1 




12.2 


6.4 


22.9 


13.7 


12.0 


2,9 


0.9 


5.7 


1.3 


1.3 


9.9 


1.7 


9.1 


11.7 




13.5 


7.2 


22.9 


11.8 


8.9 


3.6 


0.8 


6,5 


2.2 


1.7 


9.6 


1,6 


9.8 


11.2 


Control 


17.3 


4.3 


13.4 


61.6 


0.0 


12.9 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 




17.1 


4.8 


12.1 


50.6 


0.0 


14.5 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



others^ sum of 20:0. 20:1 {d5), 20:1 (d11), 20:2 (d8,11). 20:2 (d11.14). 20:3 (d5,11,14), 
20:3 (d 11,14,17). 20:4 (d5. 11,14.17) each of which is present at less than 2% of TFA 



EXAMPLE 12 

5 Isolation of a Novel Elonqase Gene from the 

Algae Pavlova so. (CCMP459) 
The fatty acid composition of the algae Pavlova sp. (CCMP 459) (Pav459) 
was investigated to determine the polyunsaturated fatty acids (PUFAs) produced by 
this organism. This algae showed a substantial amount of long chain PUFAs 

10 including eicosapentafenoic acid (EPA, 20:5n-3) and docosahexaenoic acid (DHA, 
22:6n-3). Thus, Pav459 was predicted to possess an elongase capable of 
converting EPA to u)3-docosapentaenoic acid (DPA, 22:5n-3), which a delta-4 
desaturase can convert to DHA. The goal was therefore to isolate the predicted 
elongase gene from Pav459, and to verify the functionality of the enzyme by 

15 expression in an alternate host. 

Frozen pellets of Pav45g were obtained from Provasoli-Guillard National 
Center for Culture of Marine Phytoplankton (CCMP, West Boothbay Harbor, ME). 
These pellets were crushed in liquid nitrogen and total RNA was extracted from 
Pav459 by using the Qiagen RNeasy Maxi Kit (Qiagen, Valencia, CA), per 

20 manufacturers instructions. From this total RNA, mRNA was isolated using oligo dT 
cellulose resin, which was then used for the construction of a cDNA library using the 
pSport 1 vector (Invitrogen, Carlsbad, CA). The cDNA thus produced was 
directionally cloned (5'Sa///3'A/of/) into pSportI vector. The Pav459 library contained 
approximately 6.1 x 10^ clones per ml, each with an average insert size of 

25 approximately 1200 bp. Two thousand five hundred primary clones from this library 
were sequenced from the 5' end using the T7 promoter primer (SEQ ID NO:93). 

TAATACGACTCACTATTAGG SEQ ID NO:93 

30 Sequencing was carried out using the ABI BigDye sequencing kit (Applied 

Biosystems, CA) and the MegaBase Capillary DNA sequencer (Amersham 
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biosciences, Piscataway, NJ). Two clones, designated 'pav06-C06' and pav07- 
G01,' which aligned to give a 500 bp sequence containing the 6' end of this novel 
elongase, were obtained from sequencing of the 2,500 library clones. This fragment 
shared 33.3% amino acid sequence identity with the mouse elongase MEL04 and 
5 32.7% amino acid sequence identity with 7. aureu/77 elongase TEL01 (WO 

02/08401). To isolate the full-length gene, the EST clone pav06-C06 was used as a 
template for PGR reaction with 10 pmol of the 5* primer R01327 (SEQ ID NO:94) 
and 10 pmol vector primer R0898 (SEQ ID NO:83). 

10 TGCCCATGATGTTGGCCGCAGGCTATCTTCTAGTG SEQ ID NO:94 

PGR amplification was carried out using Platinum Taq DNA polymerase 
(Invitrogen, Garlsbad, GA) in a 50 |jI total volume containing: 1 |jl of the cDNA clone 
pav06-C06. PGR buffer containing 20 mM Tris-GI, pH 8.4, 50 mM KGI (final 

15 concentration), 200 pM each deoxyribonucleotide triphosphate, 10 pmole of each 
primer, 1.5 mM MgS04, and 0.5 |jl of Platinum Taq (HF) DNA polymerase. 
Amplification was carried out as follows using the Perkin Elmer 9700 machine: initial 
denaturation at 94*=*C for 3 minute, followed by 35 cycles of the following: 94**'G for 
' 45 sec, 55''G for 30 sec, 68''G for 2 min. The reaction was terminated at 4°G. The 

20 PGR amplified mixture was run on a gel, an amplified fragment of approximately 
1.3 Kb was gel purified, and the isolated fragment was cloned into the pGR-blunt 
vector (Invitrogen, Carisbad, CA). The recombinant plasmid was transformed into 
TOP10 supercompetent cells (Invitrogen, Garlsbad, GA). and prepared. The 
prepared recombinant plasmid was digested with EcoRl, run on a gel, and the 

25 digested fragment of approximately 1.2 Kb was gel purified, and cloned into pYX242 
(EcoRI) vector (Novagen, Madison, Wl). The new plasmid was designated as 
pRPL-6-1. 

The plasmid pRPL-6-1 was prepared and sequenced using ABI 373A Stretch 
DNA Sequencer (Perkin Elmer, Foster Gity, GA). The translated amino acid 
30 sequence of the cDNA in pRPL-6-1 had 33.7% identity In 261 amino acids with 
MEL04, 33.8% Identity in 240 amino acids with GLELO. 28.1% identity in 
274 amino acids with HSEL01 , and 32.5% identity in 246 amino acids with TEL01 
(WO 02/08401). 

The construct pRPL-6-1 was transformed into S. cerevisiae 334 (Hoveland et 
35 al. (1989) Gene 83:57-64) and screened for elongase activity. S. cerevisiae 334 
containing the unaltered pYX242 vector was used as a negative control. The 
cultures were grown for 44 hours at 24*'G, in selective media (Ausubei et al., (1992) 
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Short Protocols in Molecular Biology . Ch. 13, p. 3-5), in the presence of 25 pM of 
GLA or EPA. in this study, DGI_A or a)3-docosapentaenolc acid (DPA, 22:5n-3), 
respectively, was the predicted product of the elongase activity. The lipid profiles of 
these yeast cultures Indicated that v»/hile no conversion of GLA to DGLA was seen, 
EPA was elongated to DPA at a very low level (DPA was 0.34% of total fatty acids, 
while EPA was 32.28% of total fatty acids). This indicated that the expressed 
enzyme in this culture preferred the elongation of 20 carbon chain long PUFA, and 
not the 18 carbon chain long PUFA, GLA. It also indicated that a mutation might be 
present in the DNA sequence, which is inhibiting the full activity of the expressed 
enzyme. 

To isolate the full-length gene without mutations, RACE (rapid amplification of 
cDNA ends) ready cDNA was used as a target for the reaction. To prepare this 
material, approximately 5 pg of total RNA was used according to the manufacturer's 
direction with the GeneRacer™ kit (Invitrogen, Carlsbad, CA) and Superscript II™ 
enzyme (Invitrogen, Carlsbad, CA) for reverse transcription to produce cDNA target. 
This cDNA was then used as a template for a PCR reaction with 50 pmols of the 6' 
primer R01327 and 30 pmol GeneRacer™ 3' jarimer (SEQ ID NO:95). 

GCTGTCAACGATACGCTACGTAACG SEQ ID NO:95 

PCR amplification was carried out using Platinum Taq DNA polymerase 
(Invitrogen, Carisbad, CA) in a 50 pi total volume containing: 2 pi of the RACE 
ready cDNA, PCR buffer containing 20 mM Tris-Cl, pH 8.4, 50 mM KCI (final 
concentration), 200 pM each deoxyribonucleotide triphosphate, 10 pmoie of each 
primer, 1.6 mM MgS04, and 0.5 pi of Platinum Taq (HF) DNA polymerase. 
Amplification was carried out as follows using the Peri<in Elmer 9600 machine: initial 
denaturation at 94°C for 3 minute, followed by 35 cycles of the following: 94"C for 
45 sec, SS^C for 30 sec, 68°C for 2 min. The reaction was temninated at 4''C. 

The PCR amplified mixture was run on a gel. an amplified fragment of 
approximately 1 .2 Kb was gel purified, and the isolatdd fragment was cloned Into the 
PCR-blunt vector (Invitrogen, Carisbad, CA). The recombinant plasmids were 
transformed into TOPI 0 supercompetent cells (Invitrogen, Carisbad, CA), and 
prepared. The prepared recombinant plasmid was digested with EcoRi, run on a 
gel, and the digested fragment of approximately 1.2 Kb was gel purified, and cloned 
into pYX242 (EcoRl) vector (Novagen, Madison, Wl). The new plasmids were 
designated as pRPL-6-B2 and pRPL-6-A3. 

The plasmids pRPL-6-B2 and pRPL-6-A3 were prepared and sequenced 
using ABI 373A Stretch DNA Sequencer (Perkin Elmer, Foster City, CA). The 
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translated amino acid jsequence of the cDNA in pRPL-6-B2 had 34.1% identity In 
261 amino acids with MEL04, 33.8% identity in 240 amino acids with GLELO, 
28.5% identity in 274 amino acids with HSEL01, and 32.5% identity in 246 amino 
acids with TEL01. (Plasmid pRPL-6-B2 was deposited with the American Type 
5 Culture Collection. 1 0801 Manassas, VA 201 1 0-2209 under the terms of the 
Budapest Treaty and was accorded accession number PTA-4350.) 

The constructs pRPL-6-B2 and pRPL-6-A3 were transformed into 
S. cerevisiae 334 (Hoveland et a!., supra) and screened for elongase activity. 
S. cerevisiae 334 containing the unaltered pYX242 vector was used as a negative 

10 control. The cultures were grown for 44 hours at 24°C, in selective media (Ausubel 
et al., supra), in the presence of 25 pM of GLA or EPA. In this study, DGLA or 
u)3-docosapentaenoic acid (DPA, 22:5n-3), respectively, was the predicted product 
of the elongase activity. The lipid profiles of these yeast cultures indicated that GLA 
was not elongated to DGLA in any of the samples (data not shown). The cultures of 

15 334(pRPL-6-B2) and 334(pRAT-6-A3) had significant levels of conversion of the 
substrate EPA to DPA, indicating that the expressed enzymes in these cultures 
preferred the elongation of 20-carbon chain long PUFA, and not the 18-chain long 
PUFA, GLA. 

The amino acid sequences of the 3 clones were compared to determitie if the 

20 substrate conversion levels were dictated by the translated sequences. The cDNA 
sequence of pRPL-6-1 is different from pRPL-6-B2 at A512G. This single mutation 
substantially reduced the conversion of the C20 substrate fatty acid to its elongated 
product. It appears that this is an important region of the enzyme for 20-carbon 
chain elongation. The cDNA sequence of pRPL-6-A3 is different from pRPL-6-B2 at 

25 D169N and C745R. These mutations reduced the conversion of the 020 substrate 
fatty acid to its elongated product, but the expressed enzyme was able to maintain 
"some activifyT TITe^eloT^ase gehe1h'pRPL-«-B2rh~as*the"s'eqa seffdrth in SEQ 
ID NO:49 and the amino acid sequence set forth in SEQ ID NO:50. 

To further confirm the substrate specificity of the algal elongation enzyme, 

30 described above and referred to herein as PELOIp, the recombinant yeast strain 
334(pRPL-6-B2) was grown in minimal media containing n-6 fatty acids l-A. GLA, 
DGLA, AA, or n-3 fatty acids ALA, STA, ETA. EPA, or 20:0, or 20:1. The lipid 
profiles of these yeast cultures, when examined by GC and GC-MS, indicated that 
there were accumulations of adrenic acid (ADA, 22:4-6) and EPA, respectively. The 

35 levels of these fatty acids were 1 .40% ADA and 2.54% EPA, respectively, of the 
total fatty acids in the strains containing the PEL01 sequence. These represented 
14.0% and 14.1% conversions of the substrate fatty acids, respectively, to the 
products elongated by two carbon atoms. No elongation of the saturated fatty acid 
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20:0, or monounsaturgted fatty acid 20:1 was seen. Also, no elongation of the C18 
substrates LA, GLA, ALA, or STA was seen. These results Indicated that the 
expressed enzyme activity in strain 334(pRPL-6-B2) was specific for the elongation 
of 20-carbon chain long PUFAs. and not the 18-chaln long PUFA. or the 20-carbon 
chain long saturated or monounsaturated fatty acids. 

FXAMPLE13 

Assembling HHA BiosvnthfitiR Pathway H ^nes for Expression in Somatic Soybean 
Fmhtvns ^dKR365 nKR364. and pKR357) 

Construction of plasmid pKR365 

The S. diclina delta-6 desaturase, M. alpina delta-5 desaturase and S. diclina 
delta-17 desaturase were cloned into plasmid pKR365 behind strong, seed-specific 
promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. The deltas desaturase was cloned behind the KTi promoter 
followed by the KTi 3* tennination region (Kti/Sdd6/Kti3' cassette). The delta-5 
desaturase was cloned behind the GlycininGyl promoter followed by the pea 
leguminA2 3' tennination region (Gy1/Mad5/legA2 cassette). The S. diclina delta-17 
desaturase was cloned behind the soybean Annexin promoter followed by the soy 
BD30 3' tennination region (Ann/Sdd17/BD30 cassette). Plasmid pKR365 also 
contains the T7prom/HPT/T7temi cassette for bacterial selection of the plasmid on 
hygromycin B and a bacterial origin of replication (on) from the vector pSP72 
(Stratagene). 

Plasmid pKR365 was constructed from a number of different Intennediate 
cloning vectors as follows: The Gy1/Mad5/legA2 cassette was released from 
plasmid pKR287 by digestion with Sbf\ and Bs/WI. This cassette was cloned into 
the Sbn/BslSN\ site of plasmid pKR369. containing the Kti/Sdd6/Ktl3' cassette, the 
T7prom/hptrr7temi cassette and the bacterial ori to give pKR362. The 
Ann/Sdd17/BD30 cassette, released from pKR271 (described in Example 7) by 
digestion with Pst\, was then cloned into the Sbn site of pKR362 to give pKR365. . 
A schematic representation of pKR365 is shown in Figure 6. A detailed description 
for plasmid construction for pKR287 and pKR359 is provided below. 

Plasmid pKR287 was constructed by digesting pKR136 (described in 
Example 4) with WofI, to release the M. alpina deita-5 desaturase. and cloning this 
fragment into the Not\ site of pKR263 (described in Example 4). 

Plasmid pKR359 was constructed by cloning the NoH fragment of pKR295, 
containing the delta-6 desaturase, into the A/ofl site of the Kti//Vofl/Kti3' cassette in 
pKR353. Vector pKR353 was constructed by cloning the H/ndlll fragment, 
containing the Kti/A/ofl/Kti3' cassette, from pKR124 (described In Example 2) into 
the H/ndlll site of pKR277. Plasmid pKR277 was constructed by digesting pKR197 
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(described in Example 4) with HindlW to remove the Bcon/A/ofl/phas3' cassette. To 
construct pKR295, the gene for the S. diclina delta-6 desaturase was removed from 
pRSP1 (Table 1) by digestion with EcoRI and EcoRV and cloned into the 
MfeMEcoRy site of pKR288. Vector pKR288 was an intermediate cloning vector 
5 containing a DNA stuffer fragment flanked by Not\/Mfel sites at the 5' end and 
EcoRy/Not\ sites at the 3' end of the fragment. The DNA stuffer fragment was 
amplified with Vent polymerase (NEB) from plasmid CalFad2-2 (described In 
WO 01/12800) using primer oCal-26 (SEQ ID NO:96), designed to Introduce an Mfe\ 
site at the 5' end of the fragment, and oCal-27 (SEQ ID NO:97), designed to 
1 0 introduce an EcoRV site at the 3' end of the fragment. 

GCCAATTGGAGCGAGTTCCAATCTC (SEQ ID NO:96) 

GCGATATCCGTTTCTTCTGACCTTCATC (SEQ ID NO:97). 

The primers also introduced partial Not\ sites at both ends of the fragment such that 
subsequent cloning into a filled A/of I site added A/ofl sites to the end. 

Construction of olasmid dKR364 

20 The M.alpina delta-6 desaturase, M. alpine delta-5 desaturase and S. diclina 

delta-17 desaturase were cloned into plasmid pKR364 behind strong, seed-specific 
promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. Plasmid pKR364 is identical to pKR365 except that the Not\ 
fragment that contains the S. diclina delta-6 desaturase in pKR365 was replaced 

25 with the Not\ fragment containing the M. alpina delta-6 desaturase as found In 
pKR274. A schematic representation of pKR364 Is shown in Figure 7. 

Construction of plasmid DKR357 

The S. aggregatum delta-4 desaturase, M. alpina elongase and Pavlova 

30 elongase (Tablel) were cloned into plasmid pKR357| behind strong, seed-specific 
promoters allowing for high expression of these genes in somatic soybean embryos 
and soybean seeds. The delta-4 desaturase (SEQ ID NO:51. and Its protein 
translation product shown In SEQ ID NO:52) was cloned behind the KTI promoter 
followed by the KTI 3' termination region (Kti/Sad4/Kti3' cassette). The Pavlova 

35 elongase (SEQ ID NO:49) was cloned behind the GlycininGyl promoter followed by 
the pea leguminA2 3' termination region (Gy1/Pavelo/legA2 cassette). The M. 
alpina elongase was cloned behind the promoter for the a'-subunit of p-conglycinin 
followed by the 3' transcription termination region of the phaseolin gene 
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(Pcon/Maelo/Phas3' cassette). Piasmid pKR357 also contains the 
T7prom/HPT/T7term cassette for bacterial selection of the piasmid on hygromycin 
B. a 35S/hpt/NOS3' cassette for selection in soy and a bacterial origin of replication 
(on). 

5 Piasmid pKR357 was constructed from a number of different intermediate 

cloning vectors as follows: The Gy1/Pavelo/legA2 cassette was released from 
piasmid pKR336 by digestion with Pst\ and Bs/WI. The Gy1/Pavelo/legA2 cassette 
was then cloned into the Sbf\/Bsi\N\ site of piasmid pKR324, containing the 
pcon/Maelo/Phas3' cassette, the T7prom/hpt/T7term cassette, the 35S/hpt/Nos3' 

10 cassette and the bacterial onto give pKR342. The KTi/Sad4/KTi3* cassette, 

released from pKR348 by digestion with Psfl, was then cloned into the Sbft site of 
pKR342 to give pKR357. A schematic representation of pKR357 is shown in 
Figure 8. A detailed description for piasmid construction for pKR336, pKR324 and 
pKR348 is provided below. 

15 Piasmid pKR336 was constructed by digesting pKR335 with A/ofI, to release 

the Pavlova elongase, and cloning this fragment into the Nott site of pKR263 
(described in Example 4), which contained the Gy1/Wof//legA2 cassette. To 
construct pKR335, pRPL-6-B2 (described in Table 1) was digested with Psfl« and the 
3' overhang removed by treatment v/ith VENT polymerase (NEB). The piasmid was 

20 then digested with EcoRI to fully release the Pavlova elongase as an EcoRl/Pst\ 
blunt fragment. Thi^ fragment was cloned into the MfeUEcoRy site of intermediate 
cloning vector pKR333 to give pKR335. Vector pKR333 was identical to pKR288 
(Example 3 and 13) in that it contained the same A4fel and HcoRV sites falnked by 
A/ofI sites and was generated in a similar way as pKR288. 

25 Piasmid pKR324 was constructed by cloning the A/of I fragment of pKS134 

(described in Example 3), containing the M. alpina elongase, into the Not\ site of the 
pcon/A/ofl/Phas3' cassette of vector pKR72 (described in Example 4). 

Piasmid pKR348 was constructed by cloning the Not\ fragment of pKRSOO, 
containing the S. aggregatum delta-4 desaturase, into the A/ofl site of the 

30 KTi/A/ofl/KTi3' cassette in pKR123R. To construct pKR300, the gene for the delta-4 
desaturase was removed from pRSA1 (Table 1) by digestion with EcoRI and EcoRV 
and cloned into the Mfel/EcoRV site of pKR288 (described in Example 3 and 13). 
Piasmid pKR123R contains a Nof\ site flanked by the KTi promoter and the KTI 
transcription temiination region (KTi/A/ofl/KTi3' cassette). In addition, the 

35 KTi/A/ofl/KTi3' cassette was flanked by Psfl sites. The KTi/A/ofl/KTi3' cassette was 
amplified from pKS126 (described in Example 2) using primers oKTi5 (SEQ ID 
NO:23) and oKTi7 (SEQ ID NO:98) designed to Introduce an Xba\ and Bs/WI site at 
the 5' end. and a Psfl/S/?fl and Xba\ site at the 3' end, of the cassette. 
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TTCTAGACCTGCAGGATATAATGAGCCG (SEQ ID NO:98) 

The resulting PGR fragment was subcloned into the Xbal site of the cloning 
5 vector pUC19 to give plasmid pKR123R with the KTi/A/ofl/ICri3' cassette flanlced by 
Psfl sites. 

Production of DHA in somatic embryos 

Plasmlds pKR357, pKR365 and pKR364 were prepared as described in 
Example 9. Fragments of pKR365 and pKR364 were also obtained and purified as 

10 described for pKR274, pKR275 and pKKE2 in Example 9. Plasmlds pKR357and 
either pKR365 or pKR364 were cotransformed into soybean embryogenic 
suspension cultures (cv. Jack) as described in Example 9. Hygromycin-resistant 
embryos containing pKR365 and pKR357, or pKR364 and pKR357 were selected 
and clonally propagated also as described in Example 9. Embryos were matured by 

15 culture for 4-6 weeks at 26^C in SB196 under cool white fluorescent (Phillips cool 
white Econowatt F40/CW/RS/EW) and Agro (Phillips F40 Agro) bulbs (40 watt) on a 
16:8 hr photoperlod with light intensity of 90-120 |jE/m2s. After this time embryo 
clusters were removed to a solid agar media, SB166, for 1-2 weeks. Clusters were 
then subcultured to medium SB103 for 3 weeks. During this period, individual 

2t) embryos were removed from the clusters and screened for alterations in their fatty 
acid compositions as follows. 

Fatty acid methyl esters were prepared from single, matured, somatic soy 
embryos by transesterification as described in Example 10. Retention times were 
compared to those for methyl esters of standards commercially available (Nu-Chek 

25 Prep, Inc. catalog #U-99-A). Six embryos from each event were analyzed in this 
way; Fatty acid methyl esters from embryos transformed with pKR357 and pKR366 
containing the highest levels of DHA are shown in-Table 9. 
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TABLE 9 

Fatty acid analysis of somatic embryos containing DHA pathway genes 

(pKR357 and pKR365) 

Event '16:0 '18:0 '18:1 '18:2 GLA '18:3 '18:4 

1114-6-5-1 10.8 9.4 2.3 28.8 0 19.7 2 

1114-6-5-7 13.8 8 6.4 30.1 2.1 15 2 

1116-8-16-1 13.8 7 6.2 27.3 4 10.5 0.9 

20:2(11,14) 20:3(8,11,14) ARA 20:3 (11.14,17) 20:4(5,11,14,17) EPA DHA 

1114-6-5-1 6.2 3.2 1.4 4.2 1.7 2.5 1.3 

1114-6-5-7 3.7 4.3 2.9 1.9 1.6 4.1 1.6 

1116-8-16-1 4.6 3.9 5.2 2.3 1.1 6.1 3.1 

In addition to those fatty acids shown. 20:0, 20:1, 20:3 (5,1 1 .14). DPA and 
ETA are also present in the extracts, each less than 1% of total fatty acids. 

DHA is defined as 22:6(4.7.10.13.16,19) by the nomenclature described in 
Example 1 1 . 

Fatty acid methyl esters for embryos transformed with pKR357 and pKR364 
containing the higest levels of DHA are shown in Table 10. 
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CLAIMS 

What is claimed is: 

1 . An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 1.0 % of at least one polyunsaturated fatly acid 

5 having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

2. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 5.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

3. An oilseed plant that produces mature seeds in which the total seed fatty 
10 acid profile comprises at least 10.0 % of at least one polyunsaturated fatty acid 

having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

4. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 15.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

15 5. An oilseed plant that produces mature seeds in which the total seed fatty 

acid profile comprises at least 20.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

6. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 25.0 % of at least one polyunsaturated fatty acid 

20 having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

7. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 30.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

8. An oilseed plant that produces mature seeds In which the total seed fatty 
25 acid profile comprises at least 40.0 % of at least one polyunsaturated fatty acid 

having at least-twenty carbon atoms and five-Gr-more-capbon-carbon double bonds. 

9. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises at least 50.0 % of at least one polyunsaturated fatty acid 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

30 10. An oilseed plant that produces mature seeds in which the total seed fatty 

acid profile comprises at least 60.0 % of at least one polyunsaturated fatty add 
having at least twenty carbon atoms and five or more carbon-carbon double bonds. 

1 1 . The oilseed plant of any of Claims 1-10 wherein the polyunsaturated fatty 
acid is an omega-3 fatty acid. 

36 12. The oilseed plant of any of Claims 1-10 wherein the polyunsaturated fatty 

acid is an omega-3 fatty acid selected from the group consisting of EPA, DPA, and 
DHA. 
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1 3. The oilseecj plant of any of Claims 3-1 0 wherein the total seed fatty acid 
profile further comprises less than 2.0% arachidonic acid. 

14. The oilseed plant of any of Claims 3-10 wherein the polyunsaturated fatty 
acid is an omega-3 fatty acid and the total seed fatty acid profile further comprises 

5 less than 2.0% arachidonic acid. 

15. The oilseed plant of any of Claims 3-10 wherein the polyunsaturated fatly 
acid is an omega-3 fatty acid selected from the group consisting of EPA, DPA, and 
DHA, and the total seed fatty acid profile further comprises less than 2.0% 
arachidonic acid. 

10 16. Seeds obtained from the plant of any of Claims 1-10. 

17. Seeds obtained from the plant of any of Claims 1-10 wherein the 
polyunsaturated fatty acid is an omega-3 fatty acid. 

18. Seeds obtained from the plant of any of Claims 1-10 wherein the 
polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 

15 consisting of EPA, DPA, and DHA. 

1 9. Seeds obtained from the plant of any of Claims 3-1 0 wherein the 
polyunsaturated fatty acid is an omega-3 fatty acid and the total seed fatty acid 
profile further comprises less than 2.0% arachidonic acid. 

20. Seeds obtained from the plant of any of Claims 3-1 0 wherein the 
20 polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 

consisting of EPA, DPA, and DHA, and the total seed fatty acid profile further 
comprises less than 2.0% arachidonic acid. 

21. Oil obtained from the seeds of the plants of any of Claims 1-10. 

22. Oil obtained from the seeds of the plants of any of Claims 1-10 wherein 
25 the polyunsaturated fatty acid an omega-3 fatty acid. 

23. Oil obtained from the seeds of the plants of any of Claims 1-10 wherein 
the polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 
consisting of EPA. DPA, and DHA. 

24. Oil obtained from the seeds of the plants of any of Claims 3-1 0 wherein 
30 the polyunsaturated fatty acid is an omega-3 fatty acid and the total seed fatty acid 

profile further comprises less than 2.0% arachidonic acid. 

25. Oil obtained from the seeds of the plants of any of Claims 3-1 0 wherein 
the polyunsaturated fatty acid is an omega-3 fatty acid selected from the group 
consisting of EPA, DPA, and DHA, and the total seed fatty acid profile further 

35 comprises less than 2.0% arachidonic acid. 

26. The plant of any of Claims 1-10 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
and safflower. 
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27. The plant of any of Claims 1-10 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
and safflower, and further wherein the polyunsaturated fatty acid is an omega-3 fatty 
acid. 

28. The plant of any of Claims 1-10 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
and safflower, and further wherein the polyunsaturated fatty acid is an omega-3 fatly 
acid selected from the group consisting of EPA, DPA, and DHA. 

29. The plant of any of Claims 3-10 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
and safflower, and further wherein the polyunsaturated fatty acid is an omega-3 fatty 
acid and the total seed fatty acid profile further comprises less than 2,0% 
arachidonic acid. 

30. The plant of any of Claims 3-10 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
and safflower, and further wherein the polyunsaturated fatty acid is an omega-3 fatty 
acid selected from the group consisting of EPA, DPA, and DHA, and the total seed 
fatty acid profile further comprises less than 2.0% arachidonic acid. 

31 . A recombinant construct for altering the total fatty acid profile of mature 
seeds of an oilseed plant, said construct comprising at least two promoters wherein 
each promoter is ojjerably linked to a nucleic acid sequence encoding a polypeptide 
required for making at least one polyunsaturated fatty acid having at least twenty 
carbon atoms and four or more carbon-carbon double bonds and further wherein the 
total fatty acid profile comprises at least 2% of at least one polyunsaturated fatty 
acid having at least twenty carbon atoms and four or more carbon-carbon double 
bonds-and-further wherein said polypeptide is an enzyme selected from the group 
consisting of a A 4 desaturase, a A 5 desaturase, A 6 desaturase, a A 15 
desaturase, a A 17 desaturase, a C18 to C22 elongase and a C20 to C24 elongase. 

32. The recombinant construct of Claim 31 wherein the promoter is selected 
from the group consisting of the alpha prime subunit of beta conglycinin promoter, 
Kunitz trypsin inhibitor 3 promoter, annexin promoter, Gly1 promoter, beta subunit of 
beta conglycinin promoter, P34/Gly Bd m 30K promoter, albumin promoter, Leg A1 
promoter and Leg A2 promoter. 

33. An oilseed plant comprising in its genome the recombinant construct of 
Claim 31. 

34. An oilseed plant comprising in its genome the recombinant construct of 
Claim 32. 
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35 The oilseed plant of Claim 33 wherein the oilseed plant is selected from 
the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 
safflower. 

36. The oilseed plant of Claim 34 wherein the oilseed plant is selected from 
5 the group consisting of soybean, Brassica species, sunflower, maize, cotton, flax, 

safflower. 

37. Seeds obtained from the plant of Claim 33. 

38. Seeds obtained from the plant of Claim 34. 

39. Seeds obtained from the plant of Claim 35. 
1 0 40. Seeds obtained from the plant of Claim 36. 

41 . Oil obtained from the seeds of Claim 37. 

42. Oil obtained from the seeds of Claim 38. 

43. Oil obtained from the seeds of Claim 49, 

44. Oil obtained from the seeds of Claim 40. 

15 45. A method for making an oilseed plant having an altered fatty acid profile 

which comprises: 

a) transforming a plant with the recombinant construct of Claim 31 ; 

b) growing the transformed plant of step (a); and 

c) selecting those plants wherein the total fatty acid profile comprises 
20 at least 1 .0 % of at least one polyunsaturated fatty acid having at 

least twenty carbon atoms and five or more carbon-carbon double 
bonds. 

46. An oilseed plant made by the method of Claim 30. 

47. Seeds obtained from the plant of Claim 31 . 
25 48. Oil obtained from the seeds of Claim 32. 

49. A method for making an oilseed plant having an altered fatty acid profile 
which comprises: 

a) transforming a plant with the recombinant construct of Claim 32. 

b) growing the transformed plant of step (a); and 

30 c) selecting those plants wherein the total fatty acid profile comprises 

at least 1.0 % of at least one polyunsaturated fatty acid having at 
least twenty carbon atoms and five or more carbon-carbon double 
bonds. 

50. An oilseed plant made by the method of Claim 34. 
35 51 . Seeds obtained from the plant of Claim 35. 

52. Oil obtained from the seeds of Claim 36. 

53. A food product or food analog which has incorporated therein the oil of 
Claim 21. 
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64. A food proc|uGt or food analog which has incorporated therein the oil of 
Claim 41. 

55. A food product or food analog which has incorporated therein the oil of 
Claim 42. 

66. A food product or food analogwhich has incorporated therein the oil of 
Claim 43. 

57. A food product or food analog which has incorporated therein the oil of 
Claim 44. 

58. A food product or food analog which has incorporated therein the oil of 
Claim 48. 

59. A food product or food analog which has incorporated therein the oil of 
Claim 52. 

60. The food product of Claim 53 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaciilture feed. 

61 . The food product of Claim 54 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a daify food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 

62. The food product of Claim 55 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 

63. The food product of Claim 56 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good,' an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 

64. The food product of Claim 57 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 
animal feed or aquaculture feed. 
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65. The food product of Claim 58 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 

5 animal feed or aquaculture feed. 

66. The food product of Claim 59 wherein said product is selected from the 
group consisting of a spray-dried food particle, a freeze-dried food particle, meat 
products, a cereal food, a snack food, a baked good, an extruded food, a fried food, 
a health food, a dairy food, meat analogs, cheese analogs, milk analogs, a pet food, 

10 animal feed or aquaculture feed. 

67. A beverage which has incorporated therein the oil of Claim 21 . 

68. A beverage which has incorporated therein the oil of Claim 41 . 

69. A beverage which has incorporated therein the oil of Claim 42. 

70. A beverage which has incorporated therein the oil of Claim 43. 
15 71 . A bevierage which has incorporated therein the oil of Claim 44. 

72. A beverage which has incorporated therein the oil of Claim 48. 

73. A beverage which has incorporated therein the oil of Claim 52. 

74. Infant formula which has incorporated therein the oil of Claim 21. 

75. Infant formula which has incorporated therein the oil of Claim 41, 
20 76. Infant formula which has incorporated therein the oil of Claim 42. 

77. Infant formula which has incorporated therein the oil of Claim 43. 

78. Infant formula which has incorporated therein the oil of Claim 44. 

79. Infant formula which has incorporated therein the oil of Claim 48. 

80. Infant formula which has incorporated therein the oil of Claim 52. 
25 81. A nutritional supplement which has incorporated therein the oil of 

Claim 21. 

82. A nutritional supplement which has incorporated therein the oil of 
Claim 41. 

83. A nutritional supplement which has incorporated therein the oil of 
30 Claim 42. j 

84. A nutritional supplement which has incorporated therein the oil of 
Claim 43. 

85. A nutritional supplement which has incorporated therein the oil of 
Claim 44. 

35 86. A nutritional supplement which has incorporated therein the oil of 

Claim 48. 

87. A nutritional supplement which has incorporated therein the oil of 



Claim 52. 
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88. A food product or food analog which has incorporated therein the seed of 
Claim 13. 

89. A food product or food analog which has incorporated therein the seed of 
Claim 37. 

5 90. A food product or food analog which has incorporated therein the seed of 

Claim 38. 

91 . A food product or food analog which has incorporated therein the seed of 
Claim 39. 

92. A food product or food analog which has incorporated therein the seed of 
10 Claim 40. 

93. A pet food which has incorporated therein the seed of Claim 16. 

94. A pet food which has incorporated therein the seed of Claim 37. 

95. A pet food which has incorporated therein the seed of Claim 38. 

96. A pet food which has incorporated therein the seed of Claim 39. 
15 97. A pet food which has incorporated therein the seed of Claim 40. 

98. Animal feed which has incorporated therein the seed of Claim 16. 

99. Animal feed which has incorporated therein the seed of Claim 37, 

100. Animal feed which has incorporated therein the seed of Claim 38. - 

101 . Animal feed which has incorporated therein the seed of Claim 39. 
2'0 102. Animal feed which has incorporated therein the seed of Claim 40. 

103. A pet food which has incorporated therein the oil of Claim 21 , 

104. A pet food which has incorporated therein the oil of Claim 41 . 

105. A pet food which has incorporated therein the oil of Claim 42, 

1 06. A pet food which has incorporated therein the oil of Claim 43, 
25 107. A pet food which has incorporated therein the oil of Claim 44, 

1087" A^"etfo'o*d*which"has incorporated therein the oH-ofCIaim 48, 

109. A pet food which has incorporated therein the oil of Claim 52, 

110. Animal feed which has incorporated therein the oil of Claim 21 . 

111. Animal feed which has incorporated therein the oil of Claim 41 . 
30 112. Animal feed which has incorporated therein the oil of Claim 42. 

113. Animal feed which has incorporated therein the oil of Claim 43. 

114. Animal feed which has incorporated therein the oil of Claim 44. 

115. Animal feed which has incorporated therein the oil of Claim 48. 

116. Animal feed which has incorporated therein the oil of Claim 62. 
35 1 17. A whole bean soy product made from the seed of Claim 16. 

118. A whole bean soy product made from the seed of Claim 37. 

119. A whole bean soy product made from the seed of Claim 38. 

120. A whole bean soy product made from the seed of Claim 39. 
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121 . A whole begn soy product made from the seed of Claim 40. 

122. An aquaculture food product which has incorporated therein the oil of 
Claim 21. 

123. An aquaculture food product which has incorporated therein the oil of 
5 Claim 41. 

124. An aquaculture food product which has incorporated therein the oil of 
Claim 42. 

125. An aquaculture food product which has incorporated therein the oil of 
Claim 43. 

10 126. An aquaculture food product which has incorporated therein the oil of 

Claim 44. 

127. An aquaculture food product which has incorporated therein the oil of 
Claim 48. 

128. An aquaculture food product which has incorporated therein the oil of 
15 Claim 52. 

129. An aquaculture food product which has incorporated therein the seed of 
Claim 16. 

130. An aquaculture food product which has incorporated therein the seed of 
Claim 37- 

20 131 . An aquaculture food product which has incorporated therein the seed of 

Claim 38. 

132. An aquaculture food product which has incorporated therein the seed of 
Claim 39. 

133. An aquaculture food product which has incorporated therein the seed of 
25 Claim 40. 

134. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and five or more carbon-carbon double bonds wherein the ratio of EPAiDHA 
is in the range from 1:100 to 860:100. 

30 135. An oilseed plant that produces mature seeds in which the total seed fatty 

acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and five or more carbon-carbon double bonds wherein the ratio of EPA:DHA 
is in the range from 1 :100 to 860:100 and further wherein the total seed fatty acid 
profile further comprises less than 2.0% arachidonic acid. 

35 136. An oilseed plant that produces mature seeds in which the total seed fatty 

acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and five or more carbon-carbon double bonds wherein the ratio of DHA:EPA 
is in the range from 1:100 to 110:100. 
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137. An oilseed plant that produces mature seeds in which the total seed fatty 
acid profile comprises polyunsaturated fatty acids having at least twenty carbon 
atoms and five or more carbon-carbon double bonds wherein the ratio of DHArEPA 
Is in the range from 1 : 1 00 to 1 1 0: 1 00 and further wherein the total seed fatty acid 
profile further comprises less than 2.0% arachidonic acid. 

1 38. Seeds obtained from the plant of any of Claims 1 34-1 37. 

139. Oil obtained from the seeds of the plants of any of Claims 134-137. 
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FIGURE 2 
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<110> E.I. du Pont de Nemours and Company 

<120> Production of Long Chain Polyunsaturated Fatty Acids in Plants 

<130> BB1538 PCT 

<140> 
<141> 

<150> OS 60/446,941 

<151> 2003-02-12 

<160> 98 

<170> Patentin version 3.2 



<210> 1 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> isynthetic oligonucleotide 

<400> 1 

gccccccatc ctttgaaagc ctgt 

<210> 
<211> 
<212> 
<213> 

<220> 
<223> 

<400> 2 

cgcggatccg agagcctcag catcttgagc agaa 



24 



34 



2 

34 
DNA 

Artificial Sequence 



synthetic oligonucleotide 



<210> 3 

<211> 2012 

<212> DNA 

<213> Glycine max 

<400> 3 

atcttaggcc cttgattata tggtgtttag atggattcac atgcaagttt ttatttcaat 60 

cccttttcct ttgaataact gaccaagaac aacaagaaaa aaaaaaaaag aaaaggatca 120 

ttttgaaagg atatttttcg ctcctattca aatactgtat ttttaccaaa aaaactgtat 18 0 

ttttcctaca ctctcaagct ttgtttttcg cttcgactct catgatttcc ttcatatgcc 24 0 

aatcactcta tttataaatg gcataaggta gtgtgaacaa ttgcaaagct tgtcatcaaa 300 

^gcttgcaat gtacaaatta atgtttttca tgcctttcaa aattatctgc accccctagc 360 

tattaatcta acatctaagt aaggctagtg aattttttcg aatagtcatg cagtgcatta 420 

atttccccgt gactattttg gctttgactc caacactggc- cccgtacatc cgtccctcat 480 

tacatgaaaa gaaatattgt ttatattctt aattaaaaat attgtccctt ctaaattttc 540 

atatagttaa ttattatatt acttttttct ctattctatt agttctattt tcaaattatt 600 

atttatgcat atgtaaagta cattatattt ttgctatata cttaaatatt tctaaattat 660 

taaaaaaaga ctgatatgaa aaatttattc tttttaaagc tatatcattt tatatatact 7 20 

ttttcttttc ttttctttca ttttctattc aatttaataa gaaataaatt ttgtaaattt 780 
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ttatttatca 
aatcatatca 
aaatgtgi:'ba 
tctctcattt 
taattatttt 
acactccaaa 
atgaaactga 
tggcatctga 
ctattaatta 
atatgttatt 
ttatattgat 
cataatttta 
gagaaaaaaa 
tcataaattc 
ctacgtcatc 
ccatgaatca 
tctctttccg 
atttattaac 
attagaacat 
gatccaattt 
aaaggaaagg 



atttataaaa 
ttatgattga 
tttcctaaaa 
catttttctc 
tactctcttt 
attggggcat 
aaagcattgg 
ttgattcagt 
atcttaaata 
ttttgtttag 
taaacatata 
taaggataaa 
aatattttag 
ctatatgtat 
tcttgccatt 
ctcttcacac 
tttcttttta 
gtgccaaaag 
gaaaaagcat 
cactatataa 
ttaactaaga 



atattttact 
aagagaggaa 
aaaacctaaa 
tttatctctt 
attttttctc 
gcctttatca 
caagtctcct 
atatctattg 
aataaaggat 
atgtatattc 
atcaatatta 
aaatatgata 
ccataaataa 
atttgaaatt 
tgtttttcat 
catttttact 
caacactttc 
atgcttattg 
tgttctaaca 
attgtgacga 
agcaatac-t-t 



ttatatgttt 
attgacagtg 
caaacatgta 
tctttatttt 
tctatccctc 
ctactctatc 
cccctcctca 
catgtgtaaa 
aaaatatttt 
gaataaatct 
aatatgatat 
aaaataaatt 
atgaccagca 
aaaaacagat 
gcaaacagaa 
agcaaacaag 
tttgaaatag 
aatagagtgc 
cgataatcct 
aagcaaaatg 



tttcacattt 
agtaataagt 
tctactctct 
tttatcatat 
tcttatttcc 
tcctccacta 
agtgatttcc 
agtctttcca 
tttttcttca 
aaatatatga 
ttttttatat 
ttaaatattt 
tattttacaa 
aatcgttaag 
agggacgaaa 
tctcaacaac 
tagtattttt 
acatttgtaa 
gtgaaggcgt 
aattcacata 



ttgttaaaca 
gatgagaaaa 
atttcatcta 
catttcacat 
actcatatat 
aatcatttaa 
aactcagcat 
caatacataa 
taaaattaaa 
taatgatttt 
aggttgtaca 
ttatatttac 
ccttagtaat 
ggaaggaatc 
aaccacctca 
tgaagccagc 
ttttcacatg 
tgtactacta 
taactccaaa 
gctgagagag 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2012 



<210> 4 

<211> 27 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic oligonucleotide 

<400> 4 

ggtccaatat ggaacgatga gttgata 

<210> 5 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 5 - 

cgcggatccg ctggaactag aagagagacc taaga 

<210> 6 

<211> 1408 

<212> DNA 

<213> Glycine max 



<400> 6 

aactaaaaaa 

aaaactccaa 

aaggtgatgt 

ttccccaaga 

taattttaac 

tcttgaatgc 

cgattttaca 

tactaatcta 

atatatgcat 

cctaaatgag 



agctctcaaa 
ctaaaataac 
gcctctagca 
cgtgtacgca 
agtgccgact 
agtgtcgagt 
tgtgtgaatc 
ttcctttcat 
tattaattat 
tctaactatt 



ttacattttg 
aaatagcaca 
gtctagctta 
gattaaacaa 
tttttctgta 
ttggaccatt 
agacaaaagt 
tttatatgct 
tgcagtatta 
gcatatgaat 



agttgtttca 
tgcaggtgca 
tgaggctcgc 
tggacaaaac 
aacaaaaggc 
tgagtacaaa 
gcatgcaatc 
ccacttttat 
tgctattggt 
caaatgaagg 



ggttccattg 
aacaacacgt 
tgcttatcaa 
ttcaatcgat 
cagaatcata 
gccaatattg 
acttgcaagt 
ataaaaaaat 
tttatggccc 
aagaatcatg 



ccttattgct 
tactctgatg 
cgattcatca 
tatagaataa 
tcgcacatca 
aatgattttt 
aaattaagga 
atacattatt 
tgctaaataa 
atctaaacct 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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gagtacccaa 
gacgtataaa 
cttcttttaa 
ataagctgca 
tttgaaaaat 
cgtgattcaa 
gaattttttt 
ttgtgttagc 
ttttataatc 
gcccctgctg 
aaaaagttgg 
tatcagaaat 
gctagctcac 
tcaaatcata 



tgcaataaaa 
ttaatgcata 
ctttgaaata 
atttcacatt 
gtcctacttc 
gcagtaatga 
ttaaatggat 
acaaaactta 
caaaataatc 
gataatgaaa 
caaactgtca 
taatttaaac 
taatcactat 
aaattcttcc 



tgcgtcctat 
taggttattt 
agctgataaa 
ttaccaaacc 
acattcacat 
aagtcgaaat 
atatacaagt 
cagaaagaga 
aaattaaaga 
attcatatca 
ttccctgttg 
caaatatata 
atatacgaca 
accaagtt 



tacctaaact 
tgagaaaaga. 
aatacgcttt 
gaaacaagaa 
caaattaa-tt 
cagatagaat 
ttactatttt 
taaattttaa 
atattggcta 
aaacaataca 
gtttttaagc 
gctacgaggg 
tgctacaagt 



tcaaacacac 
aaacatcaaa 
aaatcaactg 
tggtaacagt 
acaactaaat 
atacacgttt 
atatataatg 
ataaagagaa 
gatagaccgg 
gaagttctag 
caaatcacaa 
aacttcttca 
gaagtgacca 



attgccatcg 
agctctaaaa 
tgtgctgtat 
gaggcaaaaa 
aa'ataaacat 
aacatcaatt 
aaaattcatt 
ttatatccaa 
ctttttcact 
tttaataata 
ttcaattacg 
gtcattacta 
tatcttaatt 



<210> 7 

<211> 898 

<212> DNA 

<213> Glycine max 



<400> 7 

tatatatgtg 

agaaaaaaga 

atcggtgacc 

tggataataa 

qctcagttat 

ttttgtttca 

gaattttttt 

attctttaat 

cactccatct 

tctggttgtg 

ttatttctag 

aacatagagt 

cacagcacac 

gtcaagcagc 

ataactataa 



agggtagagg 
aaagggtttg 
aaacctcttt 
aaaagaaaaa 
aaaaaggaaa 
actcaatatt 
tctcctttta 
aatgaatcat 
cttgttaatt 
tattttgtta 
gatcatgcat 
atcctttcca 
ataagtggat 
aagccctccc 
atagccctaa 



gtatcacatg 
caactaaaaa 
ttgataccat 
tgatatttaa 
acaaaacaac 
tagcttttaa 
ttttatttgc 
tagtcttaca 
tgatgggcat 
tctgttggta 
atacgcgcaa 
aaacaacgtc 
gatcaagatg 
gtggccaaat 
tctcactcca 



agctctggat 
cttgggaaag 
cttccattta 
tcta'aigttca 
gtacaactca 
taattaacta 
atgtatatta 
tcattacatg 
ccattactta 
ctctggagta 
accaaagaac 
caagttcata 
ggctcgtcca 
gtgcatgcat 
tgtttcatcg 



ttccataatg 
aacaaaggtt 
atctagaata 
aacaactcga 
atcagatttc 
aggtttttat 
ggagttgtcc 
atacacatgt 
tcaaccatcc 
gcatgcataa 
agagaccgat 
aaatagagac 
tgccacgcac 
acatgttaac 
tccaataata 



aaaaggaatc 
taatcttggg 
tgaaaataag 
ttagtccttt 
aatttgctta 
attatattta 
aatgataatt 
atgagatgtc 
gccatagtta 
cgctatattt 
gtaaagacaa 
gaaatgcaag 
accaacacac 
aagagcttgc 
tatatact 



660 
720 
780 
840 
900 
960 
1020 
1080' 
1140 
1200 
1260 
1320 
1380 
1408 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
- 660 
720 
780 
840 
898 



<210> 8 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic oligonucleotide 
<400> 8 

cgcggatcct atatatgtga gggtagaggg tatcac 

<210> 9 I 

<211> 44 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 

<400> 9 

gaattcgcgg ccgcagtata tatattattg gacgatgaaa catg 
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<210> 10 

<211> 690 

<212> DNA 

<213> Glycine max 



<400> 10 

tagcctaagt 

aaacaaatta 

gataaatagt 

atttggttga 

aaaaataagt 

gaaattgaaa 

aatccaggaa 

atttgcaata 

acctcacacc 

aagaatgtct 

cttccctata 

ttggtcctta 



acgtactcaa 
ataaaacact 
taatattttt 
aaatattaat 
gtacgtggtt 
gcgagtctaa 
gaaaagaaat 
gaaacactga 
atgaacttca 
caagctcagc 
aataaccacg 
aacactcatc 



aatgccaaca 
tacaacaccg 
aataattatt 
atg-tttaaat 
aacattagta 
tttttaaatt 
gaaaccatgc 
aacacctttc 
tgaggtgtag 
accctacttc 
cctcaggttc 
agtcatcacc 



aataaaaaaa 
gatttttttt 
taaaaagccg 
caacacaatc 
cagtaatata 
atgaacctgc 
atggtcccct 
tctttgtcac 
cacccaaggc 
tgtgacgttg 
tccgcttcac 



aagttgcttt 
aattaaaatg 
tatctactaa 
tatcaaaatt 
agaggaaaat 
atatataaaa 
cgtcatcacg 
ttaattgaga 
ttccatagcc 
tccctcattc 
aactcaaaca 



aataatgcca 
tgccatttag 
aatgattttt 
aaactaaaaa 
gagaaattaa 
ggaaagaaag 
agtttctgcc 
tgccgaagcc 
atgcatactg 
accttcctct 
ttctcctcca 



<210> 11 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 11 

cgcggatcct agcctaagta cgtactcaaa atgcca 

<210> 12 

<211> 41 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 12 

gaattcgcgg ccgcggtgat gactgatgag tgtttaagga c 

<210> 13 

<211> 32 

<212> DNA 

<213> Artificial Sequence 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
690 



36 



41 



<220> 
<223> 



synthetic oligonucleotide 



<400> 13 

ttgcggccgc aaaccatggc tgctgctccc ag 

<210> 14 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



32 



4 



J 
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<400> 14 

aagcggccgc ttactgcgcc ttac 



24 



<210> 15 

<211> 34 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 15 

atctagacct gcaggccaac tgcgtttggg gctc 



34 



<210> 


16 


<211> 


40 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


16 


cttttaactt cgcggccgct tgctattgat 


<210> 


17 


^211> 


38 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


17 


caatagcaag cggccgcgaa gttaaaagca 


<210> 


18 


<211> 


35 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


18 


aatctagacg tacgcaaagg caaagattta 


<210> 


19 


<211> 


36 


<212> 


DNA 


•<213> 


Artificial Sequence 



40 



38 



35 



<220> 
<223> 



synthetic oligonucleotide 



<:400> 19 

tttctagacg tacgtccctt cttatctttg atctcc 



36 



<210> 
<211> 



20 
34 



5 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 20 

gcggccgcag ttggatagaa tatatgtttg tgac 34 

<210> 21 

<211> 41 ' 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 21 

ctatccaact gcggccgcat ttcgcaccaa atcaatgaaa g 41 

<210> 22 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 22 

aatctagacg tacgtgaagg ttaaacatgg tgaatatg 38 

<210> 23 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 23 

atctagacgt acgtcctcga agagaaggg 29 

<210> 24 

<211> 22 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 

<400> 24 

ttctagacgt acggatataa tg 22 

<210> 25 

<211> 36 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 

6 
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<400> 25 

tttctagacg tacggtctca atagattaag aagttg 



36 



<210> 26 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<400> 25 

gcggccgcga agagagatac taagagaatg ttg 



33 



<210> 27 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<400> 27 

gtatctctct tcgcggccgc atttggcacc aaatcaatg 



39 



<210> 28 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 28 

tttctagacg tacgtcaaaa aatttcattg taactc. 



36 



<210> 29 

<211> 37 

<212> DNA 

<213> Artificial Sequence 



<220>" 
<223> 



synthetic oligonucleotide 



<400> 29 

cgcggatcca tcttaggccc ttgattatat ggtgttt 



37 



<210> 30 

•<211> 43 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



synthetic oligonucleotide 



<400> 30 

gaattcgcgg ccgctgaagt attgcttctt agttaacctt tec 



43 
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<210> 31 

<211> 41 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> synthetic oligonucleotide 

<400> 31 

cgcggatcca actaaaaaaa gctctcaaat tacattttga g 41 

<210> 32 

<211> 44 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 

<400> 32 

gaattcgcgg ccgcaacttg gtggaagaat tttatgattt gaaa 44 

<210> 33 

<211> 1617 

<212> DNA 

<213> Mortierella ^Ipina 



<400> 33 

cgacactcct tccttcttct cacccgtcct agtccccttc aacccccctc tttgacaaag 60 

acaacaaacc atggctgctg ctcccagtgt gaggacgttt actcgggccg aggttttgaa 120 

tgccgaggct ctgaatgagg gcaagaagga tgccgaggca cccttcttga tgatcatcga 180 

caacaaggtg tacgatgtcc gcgagttcgt ccctgatcat cccggtggaa gtgtgattct 24 0 

cacgcacgtt ggcaaggacg gcactgacgt ctttgacact tttcaccccg aggctgcttg 300 

ggagactctt gccaactttt acgttggtga tattgacgag agcgaccgcg atatcaagaa 360 

tgatgacttt gcggccgagg tccgcaagct gcgtaccttg ttccagtctc ttggttacta 420 

cgattcttcc aaggcatact acgccttcaa ggtctcgttc aacctctgca tctggggttt 4 80 

gtcgacggtc attgtggcca agtggggcca gacctcgacc ctcgccaacg tgctctcggc 54 0 

tgcgcttttg ggtctgttct ggcagcagtg cggatggttg gctcacgact ttttgcatca 600 

ccaggtcttc caggaccgtt tctggggtga tcttttcggc gccttcttgg gaggtgtctg 660 

ccagggcttc tcgtcctcgt ggtggaagga caagcacaac actcaccacg ccgcccccaa 720 

cgtccacggc gaggatcccg acattgacac ccaccctctg ttgacctgga gtgagcatgc 780 

gttggagatg ttctcggatg tcccagatga ggagctgacc cgcatgtggt cgcgtttcat 840 

ggtcctgaac cagacctggt tttacttccc cattctctcg tttgcccgtc tctcctggtg 900 

cctccagtcc attctctttg tgctgcctaa cggtcaggcc cacaagccct cgggcgcgcg 960 

tgtgcccatc tcgttggtcg agcagctgtc gcttgcgatg cactggacct ggtacctcgc 1020 

caccatgttc ctgttcatca aggatcccgt caacatgctg gtgtactttt tggtgtcgca 1080 

ggcggtgtgc ggaaacttgt tggcgatcgt gttctcgctc aacpacaacg gtatgcctgt 1140 

gatctcgaag gaggaggcgg tcgatatgga tttcttcacg aagcagatca tcacgggtcg 1200 

tgatgtccac ccgggtctat ttgccaactg gttcacgggt ggattgaact atcagatcga 1260 

gcaccacttg ttcccttcga tgcctcgcca caacttttca aagatccagc ctgctgtcga 1320 

gaccctgtgc aaaaagtaca atgtccgata ccacaccacc ggtatgatcg agggaactgc 1380 

agaggtcttt agccgtctga acgaggtctc caaggctgcc tccaagatgg gtaaggcgca 14 40 

gtaaaaaaaa aaacaaggac gttttttttc gccagtgcct gtgcctgtgc ctgcttccct 1500 

tgtcaagtcg agcgtttctg gaaaggatcg ttcagtgcag tatcatcatt ctccttttac 1560 

cccccgctca tatctcattc atttctctta ttaaacaact tgttcccccc ttcaccg 1617 



<2ia> 34 

<211> 457 

<212> PRT 

<213> Mortierella alpina 



8 
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<400> 34 

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
1 5 10 15 

Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
20 25 30 

Leu Met He He Asp Asn Lys Val Tyr Asp Val Arg Glu. Phe Val Pro 
35 40 45 

Asp His Pro Gly Gly Ser Val He Leu Thr His Val Gly Lys Asp Gly 
50 55 60 

Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65 70 .75 80 

Ala Asn Phe Tyr Val Gly Asp He Asp Glu Ser Asp Arg Asp He Lys 
85 90 95 

Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gin 
100 105 110 

Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
115 120 125 

Ser Phe Asn Leu Cys He Trp Gly Leu Ser Thr Val He Val Ala Lys 
130 135 140 

Trp Gly Gin Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145 150 155 160 

Gly Leu Phe Trp Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 

His Gin Val Phe Gin Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
180 185 190 

Leu Gly Gly Val Cys Gin C31y Phe Ser Ser Ser Trp Trp Lys Asp Lys 
195 200 205 

His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
210 215 220 

He Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225 230 235 240 

Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
245 250 I 255 

Met Val Leu Asn Gin Thr Trp Phe Tyr Phe Pro He Leu Ser Phe Ala 
260 265 270 

Arg Leu Ser Trp Cys Leu Gin Ser He Leu Phe Val Leu Pro Asn Gly 
275 280 285 

Gin Ala His Lys Pro Ser Gly Ala Arg Val Pro He Ser Leu Val Glu 
290 295 300 
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Gin Leu Ser Leu Ala 
305 

Leu Phe lie Lys Asp 
325 



Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
310 315 320 

Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
330 335 



Gin Ala Val Cys Gly 
340 



Asn Leu Leu Ala lie Val Phe Ser Leu Asn His 
345 350 



ASn Gly Met Pro Val 
355 



lie Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
360 365 



Phe Thr Lys Gin He 

370 



He Thr Gly Arg Asp Val His Pro Gly Leu Phe 
375 380 



Ala Asn Trp Phe Thr 
385 



Gly Gly Leu Asn Tyr Gin He Glu His His Leu 
390 395 400 



Phe Pro Ser Met Pro 
405 



Arg His Asn Phe Ser Lys He Gin Pro Ala Val 
410 415 



Glu Thr Leu Cys Lys 
420 



Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
425 430 



He Glu Gly Thr Ala 
435 



Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
440 445 



Ala Ala Ser Lys Met 
450 



Gly Lys Ala Gin 
455 



<210> 35 

<211> 1362 

<212> DNA 

<213> Saprolegnia diclina 



<400> 35 

atggtccagg 

caagacaacg 

cacccgggcg 

gtcttccacc 

cagtcgacgg 

ttcattgcgt 

agcaagctct 

gccatttgcc 

ctcttttacc 

aaccacttgt 

gtgcagtggt 

cccgagatcg 

ctcaagatgg 

gcgtacctgt 

atgtacgcct 

ctcgagcgcg 

aacatgtcgc 

ttcctcgcga 

aagcccgatt 

atcgactggt 

ccccggcaca 

atcccatacc 

cgcatctcga 



ggcaaaaggc 
cgtggatcgt 
gcgtcgtcat 
cgagctcggc 
cggccgtcga 
cgtaccgcaa 
actacctcta 
tccactttga 
agcagtgcgg 
ttggcgacct 
ggaagaacaa 
ccttccacgg 
cgcagcacgc 
actttcccat 
tctacaacgt 
ccggcctcct 
tgctccaagc 
tggtctttag 
tttggaagct 
tcatgggcgg 
acctcccggc 
acgagacggg 
tcgagttctt 



cgagaagatc 
gatccaccac 
gttcacgcag 
gctcaagctc 
cacgtcgatc 
gctgcgcctt 
caagtgcgcc 
ctcgacggcc 
ctggctcgcc 
cgtcggcgtc 
gcacaacacg 
cgacccggac 
ggtcgactcg 
cttgctcttt 
tgggcccggc 
cctctactac 
ggctgcgttc 
cgtcggccac 
gcaagtgctc 
cctcaactac 
gctcaacgtg 
cttcatcgcg 
caaggagttt 



tcgtgggcga 
aaggtgtacg 
gccggcgaag 
ctcgagcagt 
tcggacgagg 
gaagtcaagc 
tcgacgctga 
atgtacatgg 
catgactttc 
atggtcggca 
caccatgcga 
attgacacga 
cccgtcgggc 
gcgcgtatct 
ggcacctttg 
ggctggaacc 
ctctttgtga 
aacggcatgg 
tcgacgcgca 
cagatcgacc 
ctcgtcaagt 
ggcatggccg 
cccgccatgt 



ccatccgtga 
acatctcggc 
acgcgaccga 
actacgtcgg 
tcaagaagag 
gcctcggctt 
gcattgcgct 
tcgcggctgt 
tgcaccacca 
acctctggca 
tccccaacct 
tgccgattct 
tcttcttcat 
cgtgggtgat 
acaaggtcca 
tcggccttgt 
gccaggcgtc 
aggtctttga 
acgtgacgtc 
accacttgtt 
cgctctgcaa 
aggtcgtcgt 
aa 



gcacaaccgc 
ctttgaggac 
tgcgttcgct 
cgacgtcgac 
ccagtcggac 
gtacgactcg 
tgtgtcggcg 
catccttggc 
agtgtttgag 
gggcttctcg 
ccacgcgacg 
cgcgtggtcg 
gcgctaccaa 
ccagtcggcc 
gtacccgctg 
gtacgcagcc 
gtgcggcctc 
caaggacagc 
gtcgctctgg 
cccgatggtg 
gcagtacgac 
gcacctcgag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1362 
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<210> 36 
<211> 453 
<212> PRT 

<213> Saprolegnia diclina 

<400> 36 

Met Val Gin Gly Gin Lys Ala Glu Lys lie Ser Trp Ala Thr lie Arg 
15 10 15 

Glu His Asn Arg Gin Asp Asn Ala Trp lie Val lie His His Lys Val 
20 25 30 

Tyr Asp He Ser Ala Phe Glu Asp His Pro Gly Gly Val Val Met Phe 
35 40 45 

Thr Gin Ala Gly Glu Asp Ala Thr Asp Ala Phe Ala Val Phe His Pro 
50 55 60 

Ser Ser Ala Leu Lys Leu Leu Glu Gin Tyr Tyr Val Gly Asp Val Asp 
65 70 75 80 

Gin Ser Thr Ala Ala Val Asp Thr Ser He Ser Asp Glu Val Lys Lys 

85 90 95 

Ser Gin Ser Asp Phe He Ala Ser Tyr Arg Lys Leu Arg Leu Glu Val 
100 105 110 

Lys Arg Leu Gly Leu Tyr Asp Ser Ser Lys Leu Tyr Tyr Leu Tyr Lys 
115 120 125 

Cys Ala Ser Thr Leu Ser He Ala Leu Val Ser Ala Ala He Cys Leu 
130 135 140 . 

His Phe Asp Ser Thr Ala Met Tyr Met Val Ala Ala Val He Leu Gly 
145 150 155 160 

Leu Phe Tyr Gin Gin Cys Gly Trp Leu Ala His Asp Phe Leu His His 

165 170 175 

Gin Val Phe Glu Asn His Leu Phe Gly Asp Leu Val Gly Val Met Val 
180 185 , 190 

Gly Asn Leu Trp Gin Gly Phe Ser Val Gin Tip Trp Lys Asn Lys His 
195 200 205 

Asn Thr His His Ala He Pro Asn Leu His Ala Thr Pro Glu He Ala 
210 215 220 

Phe His Gly Asp Pro Asp He Asp Thr Met Pro He! Leu Ala Trp Ser 
225 230 235 240 

Leu Lys Met Ala Gin His Ala Val Asp Ser Pro Val Gly Leu Phe Phe 
245 250 255 

Met Arg Tyr Gin Ala Tyr Leu Tyr Phe Pro He Leu Leu Phe Ala Arg 
260 265 270 

He Ser Trp Val He Gin Ser Ala Met Tyr Ala Phe Tyr Asn Val Gly 
275 280 285 
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Pro Gly Gly Thr Phe Asp Lys Val Gin Tyr Pro Leu Leu Glu Arg Ala 
290 295 300 

Gly Leu Leu Leu Tyr Tyr Gly Trp Asn Leu Gly Leu Val Tyr Ala Ala 
305 310 315 320 

Asn Met Ser Leu Leu Gin Ala Ala Ala Phe Leu Phe Val Ser Gin Ala 
325 330 335 

Ser Cys Gly Leu Phe Leu Ala Met Val Phe Ser Val Gly His Asn Gly 

340 345 350 

Met Glu Val Phe Asp Lys Asp Ser Lys Pro Asp Phe Trp Lys Leu Gin 
355 360 365 

Val Leu Ser Thr Arg Asn Val Thr Ser Ser Leu Trp lie Asp Trp Phe 
370 375 380 

Met Gly Gly Leu Asn Tyr Gin He Asp His His Leu Phe Pro Met Val 
385 390 395 400 

Pro Arg His Asn Leu Pro Ala Leu Asn Val Leu Val Lys Ser Leu Cys 
405 410 415 

Lys Gin Tyr Asp He Pro Tyr His Glu Thr Gly Phe He Ala Gly Met 
420 425 ' 430 

Ala Glu Val Val Val His Leu Glu Arg He Ser He Glu Phe Phe Lys 
435 440 445 

Glu Phe Pro Ala Met 
450 

<210> 37 
<211> 1413 
<212> DNA 

<213> Saprolegnia diclina 

<400> 37 

atggccccgc agacggagct ccgccagcgc cacgccgccg tcgccgagac gccggtggcc 60 

ggcaagaagg cctttacatg gcaggaggtc gcgcagcaca acacggcggc ctcggcctgg 120 

atcattatcc gcggcaaggt ctacgacgtg accgagtggg ccaacaagca ccccggcggc 180 

cgcgagatgg tgctgctgca cgccggtcgc gaggccaccg acacgttcga ctcgtaccac 240 

ccgttcagcg acaaggccga gtcgatcttg aacaagtatg agattggcac gttcacgggc 300 

ccgtccgagt ttccgacctt caagccggac acgggcttct acaaggagtg ccgcaagcgc 360 

gttggcgagt acttcaagaa gaacaacctc catccgcagg acggcttccc gggcctctgg 420 

cgcatgatgg tcgtgtttgc ggtcgccggc ctcgccttgt acggcatgca cttttcgact 480 

atctttgcgc tgcagctcgc ggccgcggcg ctctttggcg tctgccaggc gctgccgctg 540 

ctccacgtca tgcacgactc gtcgcacgcg tcgtacacca acatgccgtt cttccattac 600 

gtcgtcggcc gctttgccat ggactggttt gccggcggct cgatggtgtc atggctcaac 660 

cagcacgtcg tgggccacca catctacacg aacgtcgcgg gctcggaccc ggatcttccg 720 

gtcaacatgg acggcgacat ccgccgcatc gtgaaccgcc aggtgttcca gcccatgtac 780 

gcattccagc acatctacct tccgccgctc tatggcgtgc ttggcctcaa gttccgcatc 8 40 

caggacttca ccgacacgtt cggctcgcac acgaacggcc cgatccgcgt caacccgcac 900 

gcgctctcga cgtggatggc catgatcagc tccaagtcgt tctgggcctt ctaccgcgtg 960 

taccttccgc ttgccgtgct ccagatgccc atcaagacgt accttgcgat cttcttcctc 1020 

gccgagtttg tcacgggctg gtacctcgcg ttcaacttcc aagtaagcca tgtctcgacc 1080 

gagtgcggct acccatgcgg cgacgaggcc aagatggcgc tccaggacga gtgggcagtc 1140 

tcgcaggtca agacgtcggt cgactacgcc catggctcgt ggatgacgac gttccttgcc 1200 

ggcgcgctca actaccaggt cgtgcaccac ttgttcccca gcgtgtcgca gtaccactac 12 60 

ccggcgatcg cgcccatcat cgtcgacgtc tgcaaggagt acaacatcaa gtacgccatc 1320 

12 
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ttgccggact ttacggcggc gttcgttgcc cacttgaagc acctccgcaa catgggccag 1380 
cagggcatcg ccgccacgat ccacatgggc taa 1413 

<210> 38 
<211> 470 
<212> PRT 

<213> Saprolegnia diclina 
<400> 38 

Met Ala Pro Gin Thr Glu Leu Arg Gin Arg His Ala Ala Val Ala Glu 
15 10 15 

Thr Pro Val Ala Gly Lys Lys Ala Phe Thr Trp Gin Glu Val Ala Gin 
20 25 30 

His Asn Thr Ala Ala Ser Ala Trp lie He He Arg Gly Lys Val Tyr 
35 40 45 

Asp Val Thr Glu Trp Ala Asn Lys His Pro Gly Gly Arg Glu Met Val 
50 55 60 

Leu Leu His Ala Gly Arg Glu Ala Thr Asp Thr Phe Asp Ser Tyr His 
65 70 75 80 

Pro Phe Ser Asp Lys Ala Glu Ser He Leu Asn Lys Tyr Glu He Gly 
85 . 90 95 

Thr Phe Thr Gly Pro Ser Glu Phe Pro Thr Phe Lys Pro Asp Thr Gly 
100 105 110 

Phe Tyr Lys Glu Cys Arg Lys Arg Val Gly Glu Tyr Phe Lys Lys Asn 
115 120 125 

Asn Leu His Pro Gin Asp Gly Phe Pro Gly Leu Trp Arg Met Met Val 
130 135 140 

Val Phe Ala Val Ala Gly Leu Ala Leu Tyr Gly Met His Phe Ser Thr 
145 150 155 160 

He Phe Ala Leu Gin Leu Ala Ala Ala Ala Leu Phe Gly Val Cys Gin 
165 170 175 

Ala Leu Pro Leu Leu His Val Met His Asp Ser Ser His Ala Ser Tyr 
180 185 190 

Thr Asn Met Pro Phe Phe His Tyr Val Val Gly Arg Phe Ala Met Asp 
195 200 205 

Trp Phe Ala Gly Gly Ser Met Val Ser Trp Leu AsJ Gin His Val Val 
210 215 220 

Gly His His He Tyr Thr Asn Val Ala Gly Ser Asp Pro Asp Leu Pro 
225 230 235 240 

Val Asn Met Asp Gly Asp He Arg Arg He Val Asn Arg Gin Val Phe 
245 250 255 

Gin Pro Met Tyr Ala Phe Gin His He Tyr Leu Pro Pro Leu Tyr Gly 
260 265 270 
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Val Leu Gly Leu Lys Phe Arg He Gin Asp Phe Thr 
275 280 

Ser His Thr Asn Gly Pro He Arg -Val Asn Pro His 
290 295 300 

Trp Met Ala Met He Ser Ser Lys Ser Phe Trp Ala 
305 310 315 

Tyr Leu Pro Leu Ala Val Leu Gin Met Pro He Lys 
325 330 

He Phe Phe Leu Ala Glu Phe Val Thr Gly Trp Tyr 
340 345 

Phe Gin Val Ser His Val Ser Thr Glu Cys Gly Tyr 
355 360 

Glu Ala Lys Met Ala Leu Gin Asp Glu Trp Ala Val 

370 375 380 

Thr Ser Val Asp Tyr Ala His Gly Ser Trp Met Thr 
385 390 395 

Gly Ala Leu Asn Tyr Gin Val Val His His Leu Phe 
405 410 



Asp Thr Phe Gly 
285 

Ala Leu Ser Thr 



Phe Tyr Arg Val 
320 

Thr Tyr Leu Ala 
335 

Leu Ala Phe Asn 
350 

Pro Cys Gly Asp 
365 

Ser Gin Val Lys 



Thr Phe Leu Ala 
400 

Pro Ser Val Ser 
415 



Gin Tyr His Tyr Pro Ala He Ala Pro He He 
420 425 

Glu Tyr Asn He Lys Tyr Ala He Leu Pro Asp 
435 440 

Val Ala His Leu Lys His Leu Arg Asn Met Gly 
450 455 

Ala Thr He His Met Gly 
465 470 



Val 



Phe 



Gin 
460 



Asp Val Cys Lys 
430 

Thr Ala Ala Phe 
445 

Gin Gly He Ala 



<210> 39 

<211> 819 

<212> DNA 

<213> Thraustochytriuici aureuia 



<400> 39 

atggcaaaca 

tggatggatg 

accatgctgg 

aggcagatgg 

ttcggacttt 

aaagtgtttg 

atcgtgtacg 

ctttgcaaga 

gccatctggt 

ctcaactctt 

gggttcgtga 

atgcttgtgc 

cagcttcttg 

agctatctta 



gcagcgtgtg 
gcgccaagcc 
cattcgaggt 
agaagccttt 
ccttgtacat 
gaaacgacat 
tgttctacgt 
agttcaacca 
gggctatcgc 
tcgtgcacac 
agccaatcaa 
agtccttgta 
gagtgtacat 
aaaagccaaa 



ggatgatgtg 
gtacgcactc 
gggatacatg 
tgagctcaag 
gtgcgtggtg 
ggagaagggc 
gtccaaggca 
ggtttccttc 
caagtacgct 
cgtcatgtac 
gccgtacatc 
cgactacctc 
gatcaccttg 
aaagagcaag 



gtgggccgcg 
accgatgggc 
gccatgctgc 
accatcaagc 
accatccgcc 
aacgagtctc 
tacgagttct 
ttgcatgtgt 
ccaggaggtg 
gcatactact 
accacccttc 
ttcccatgcg 
cttgccctct 
accaactaa 



tggagaccgg 
tcccgatgat 
tcttcggcat 
tcttgcacaa 
aggctatcct 
atgctcaggg 
tggataccgc 
accaccatgc 
atgcgtactt 
tcttctcctc 
agatgaccca 
actacccaca 
tcggcaactt 



cgtggaccag 
ggacgtgtcc 
cccgatcatg 
cttgtttctc 
tggaggctac 
catgtctcgc 
catcatgatc 
caccattttt 
ttcagtgatc 
ccaagggttc 
gttcatggca 
ggctcttgtg 
ttttgtgcag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
819 
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<210> 40 

<211> 272 

<212> PRT 

<213> Thraustochytrium aureum 

<400> 40 

Met Ala Asn Ser Ser Val Trp Asp Asp Val Val Gly Arg Val Glu.Thr 
1.5 10 15 

Gly Val Asp Gin Trp Met Asp Gly Ala Lys Pro Tyr Ala Leu Thr Asp 
20 25 30 

Gly Leu Pro Met Met Asp Val Ser Thr Met Leu Ala Phe Glu Val Gly 
35 40 45 

Tyr Met Ala Met Leu Leu Phe Gly lie Pro He Met Arg Gin Met Glu 
50 55 60 

Lys Pro Phe Glu Leu Lys Thr He Lys Leu Leu His Asn Leu Phe Leu 
65 70 75 : 80 

Phe Gly Leu Ser Leu Tyr Met Cys Val Val Thr He Arg Gin Ala He 
85 90 95 

Leu Gly Gly Tyr Lys Val Phe Gly Asn Asp Met Glu Lys Gly Asn Glu 
100 105 110 

Ser His Ala Gin Gly Met Ser Arg He Val Tyr Val Phe Tyr Val Ser 
115 120 125 

Lys Ala Tyr Glu Phe Leu Asp Thr Ala He Met He Leu Cys Lys Lys 
130 135 140 

Phe Asn Gin Val Ser Phe Leu His Val Tyr His His Ala Thr He Phe 
145 150 155 160 

Ala He Trp Trp Ala He Ala Lys Tyr JUa Pro Gly Gly Asp Ala Tyr 
165 170 175 

Phe Ser Val He Leu Asn Ser Phe Val His Thr Val Met Tyr Ala Tyr 
180 185 190 

Tyr Phe Phe Ser Ser Gin Gly Phe Gly Phe Val Lys Pro He Lys Pro 

195 200 205 

Tyr He Thr Thr Leu Gin Met Thr Gin Phe Met Ala Met Leu Val Gin 
210 215 220 

Ser Leu Tyr Asp Tyr Leu Phe Pro Cys Asp Tyr Pro' Gin Ala Leu Val 
225 230 235 240 

Gin Leu Leu Gly Val Tyr Met He Thr Leu Leu Ala Leu Phe Gly Asn 
245 250 255 

Phe Phe Val Gin Ser Tyr Leu Lys Lys Pro Lys Lys Ser Lys Thr Asn 
260 265 270 

<210> 41 
<211> 1077 
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<212> DNA 

<213> Saprolegnia diclina 



<400> 41 

atgactgagg 

ccgaacgcgt 

ttcaacgcgt 

gataacgttc 

ttctggggct 

agcgtcaact 

tggcgcgtga 

ttttacccgc 

ggcggtgcgt 

gacccgtggg 

tgggccgcct 

ggcctctact 

ttgcaccaca 

ggcaacctct 

attggcacgc 

gccaccaagc 

atcacggcct 

gcgcagatct 



ataagacgaa 
gctttgagtc 
cggcctcggc 
tgctccacgc 
tcttcacggt 
ttatcatcgg 
cgcaccgcca 
accggtcggt 
ggtttgtcta 
acccgctcct 
tcttcgccgc 
actatgcgcc 
acgacgaagc 
cgagcgtcga 
accaggtcca 
actttgcggc 
tcttcaagac 
tcacgctcaa 



ggtcgagttc 
gaacctcggc 
ggcgctgctc 
gctcgtttgc 
cggccacgac 
ctgcatcatg 
ccaccacaag 
caaggacctc 
cttgaaggtc 
ccttcgccgc 
gtacgcgtac 
gctctttgtc 
gacgccgtgg 
ccgctcgtac 
ccacttgttc 
cgcgtacccg 
cgcgcacctc 
agagtcggcc 



ccgacgctca 
ctctcgctct 
tacgcggcgc 
gccacctaca 
tgcggccact 
cactctgcga 
aacacgggca 
caggacgtgc 
gggtatgccc 
gcgtcggccg 
ctcacatact 
tttgcttcgt 
tacggcgact 
ggcgcgttcg 
ccgatcattc 
cacctcgtgc 
tttgtcaact 
gcggccgcca 



cggagctcaa 
actacacggc 
gctcgacgcc 
tctacgtgca 
cggccttptc 
ttttgacgcc 
acattgataa 
gccaatgggt 
cgcgcacgat 
tcatcgtgtc 
cgctcggctt 
tcctcgtcat 
cggagtggac 
tggacaacct 
cgcactacaa 
gcaggaacga 
acggcgctgt 
aggccaagtc 



gcackcgalzc 


OU 


ccgcgcgatc 


120 


gttcattgcc 


ion 
loO 


gggcgtcatc 


240' 


gcgctaccac 


■5 ft A 


gttcgagagc 


360 


ggacgagatc 


420 


ctacacgctc 


480 


gagccacttt 


540 


gctcggcgtc 


600 


tgccgtcatg 


660 


tacgaccttc 


720 


gtacgtcaag 


780 


gagccaccac 


840 


gctcaacgaa 


900 


cgagcccatc 


960 


gcccgagacg 


1020 


ggactaa 


1077 



<210> 42 
<211> 358 
^212> PRT 

<213> Saprolegnia diclina 
<400> 42 

Met Thr Glu Asp Lys Thr Lys Val Glu Phe Pro Thr Leu Thr Glu Leu 
15 10 15 

Lys His Ser lie Pro Asn Ala Cys Phe Glu Ser Asn Leu Gly Leu Ser 
20 25 30 

Leu Tyr Tyr Thr Ala Arg Ala He Phe Asn Ala Ser Ala Ser Ala Ala 
35 40 45 

Leu Leu Tyr Ala Ala Arg Ser Thr Pro Phe He Ala Asp Asn Val Leu 
50 55 60 

Leu His Ala Leu Val Cys Ala Thr Tyr He Tyr Val Gin Gly Val He 
65 70 75 80 

Phe Trp Gly Phe Phe Thr Val Gly His Asp Cys Gly His Ser Ala Phe 
85 90 95 

Ser Arg Tyr His Ser Val Asn Phe He He Gly Cys He Met His Ser 
100 105 110 

Ala He Leu Thr Pro Phe Glu Ser Trp Arg Val Thr His Arg His His 
115 120 125 

His Lys Asn Thr Gly Asn He Asp Lys Asp Glu He Phe Tyr Pro His 
130 135 140 

Arg Ser Val Lys Asp Leu Gin Asp Val Arg Gin Trp Val Tyr Thr Leu 
145 150 155 160 
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Gly Gly Ala Trp Phe Val Tyr Leu Lys Val Gly 
165 170 



Tyr Ala Pro Arg Thr 
175 



Met Ser His Phe Asp Pro Trp Asp Pro Leu Leu 
180 185 



Leu Arg Arg Ala Ser 
190 



Ala Val He Val Ser Leu Gly Val Trp Ala Ala 
195 200 



Phe Phe Ala Ala Tyr 
. 205 



Ala Tyr Leu Thr Tyr Ser Leu Gly Phe Ala Val 
210 215 



Met Gly Leu Tyr Tyr 
220 



Tyr Ala Pro Leu Phe Val Phe Ala Ser Phe Leu 
225 230 235 



Val He Thr Thr Phe 
240 



Leu His His Asn Asp Glu Ala Thr Pro Trp Tyr 
245 250 



Gly Asp Ser Glu Trp 
255 



Thr Tyr Val Lys Gly Asn Leu Ser Ser Val Asp 
260 265 



Arg Ser Tyr Gly Ala 

270 



Phe Val Asp Asn Leu Ser His His He Gly Thr 
275 280 



His Gin Val His His 
285 



Leu Phe Pro He He Pro His Tyr Lys Leu Asn 
290 295 



Glu Ala Thr Lys His 
300 



Phe Ala Ala Ala Tyr Pro His Leu Val Arg Arg 
305 310 315 



Asn Asp Glu Pro He 
320 



He Thr Ala Phe Phe Lys Thr Ala His Leu Phe 
325 330 



Val Asn Tyr Gly Ala 
335 



Val Pro Glu Thr Ala Gin He Phe Thr Leu Lys 
340 345 



Glu Ser Ala Ala Ala 
350 



Ala Lys Ala Lys Ser Asp 
355 

<210> 43 
<211> 957 
<212> DNA 

<213> Mortierella alpina 
<400> 43 

atggagtcga ttgcgccatt cctcccatca aagatgccgc aagatctgtt tatggacctt 60 

gccaccgcta tcggtgtccg ggccgcgccc tatgtcgatc ctctcgaggc cgcgctggtg 120 

gcccaggccg agaagtacat ccccacgatt gtccatcaca cgcgtgggtt cctggtcgcg 180 

gtggagtcgc ctttggcccg tgagctgccg ttgatgaacc cgttccacgt gctgttgatc 240 

•gtgctcgctt atttggtcac ggtctttgtg ggcatgcaga tcatgaagaa ctttgagcgg 300 

ttcgaggtca agacgttttc gctcctgcac aacttttgtc tggtctcgat cagcgcctac 3 60 

atgtgcggtg ggatcctgta cgaggcttat caggccaact atggactgtt tgagaacgct 4 20 

gctgatcata ccttcaaggg tcttcctatg gccaagatga tctggctctt ctacttctcc 4 80 

aagatcatgg agtttgtcga caccatgatc atggtcctca agaagaacaa ccgccagatc 540 

tccttcttgc acgtttacca ccacagctcc atcttcacca tctggtggtt ggtcaccttt 600 

gttgcaccca acggtgaagc ctacttctct gctgcgttga actcgttcat ccatgtgatc 660 

atgtacggct actacttctt gtcggccttg ggcttcaagc aggtgtcgtt catcaagttc 720 

tacatcacgc gctcgcagat gacacagttc tgcatgatgt cggtccagtc ttcctgggac 780 

atgtacgcca tgaaggtcct tggccgcccc ggatacccct tcttcatcac ggctctgctt 840 
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tggttctaca tgtggaccat gctcggtctc ttctacaact tttacagaaa gaacgccaag 900 
ttggccaagc aggccaaggc cgacgctgcc aaggagaagg caaggaagtt gcagtaa 957 

<210> 44 
<211> 318 
<212> PRT 

<213> Mortierella alpina 
<400> 44 

Met Glu Ser lie Ala Pro Phe Leu Pro Ser Lys Met Pro Gin Asp Leu 
15 10 15 

Phe Met Asp Leu Ala Thr Ala lie Gly Val Arg Ala Ala Pro Tyr Val 
20 25 30 

Asp Pro Leu Glu Ala Ala Leu Val Ala Gin Ala Glu Lys Tyr lie Pro 
35 40 45 

Thr lie Val His His Thr Arg Gly Phe Leu Val Ala Val Glu Ser Pro 
50 55 60 

Leu Ala Arg Glu Leu Pro Leu Met Asn Pro Phe His Val Leu Leu lie 
65 70 75 80 

Val Leu Ala Tyr Leu Val Thr Val Phe Val Gly Met Gin lie Met Lys 
85 90 95 

Asn Phe Glu Arg Phe Glu Val Lys Thr Phe Ser Leu Leu His Asn Phe 
100 105 110 

Cys Leu Val Ser lie Ser Ala Tyr Met Cys Gly Gly lie Leu Tyr Glu 
115 ' 120 125 

Ala Tyr Gin Ala Asn Tyr Gly Leu Phe Glu Asn Ala Ala Asp His Thr 
130 135 140 

Phe Lys Gly Leu Pro Met JUa Lys Met He Trp Leu Phe Tyr Phe Ser 
145 150 155 160 

Lys He Met Glu Phe Val Asp Thr Met He Met Val Leu Lys Lys Asn 
165 170 175 

Asn Arg Gin He Ser Phe Leu His Val Tyr His His Ser Ser He Phe 
180 185 190 

Thr He Trp Trp Leu Val Thr Phe Val Ala Pro Asn | Gly Glu Ala Tyr 
195 200 205 

Phe Ser Ala Ala Leu Asn Ser Phe He His Val He Met Tyr Gly Tyr 
210 215 220 

Tyr Phe Leu Ser Ala Leu Gly Phe Lys Gin Val Ser Phe He Lys Phe 
225 230 235 240 

Tyr He Thr Arg Ser Gin Met Thr Gin Phe Cys Met Met Ser Val Gin 
245 250 255 

Ser Ser Trp Asp Met Tyr Ala Met Lys Val Leu Gly Arg Pro Gly Tyr 
260 265 270 
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Pro Phe Phe He Thr Ala Leu Leu Trp Phe Tyr Met Trp Thr Met Leu 
275 280 285 

Gly Leu Phe Tyr Asn Phe Tyr Arg Lys Asn Ala Lys Leu Ala Lys Gin 
290 295 300 

Ala Lys Ala Asp Ala Ala Lys Glu Lys Ala Afg Lys Leu Gin 
305 310 315 

<210> 45 

<211> 1483 

<212> DNA 

<213> Mortierella alpina 



<400> 45 

gcttcctcca 

gggaacggac 

cgacctactc 

tcctggtgga 

gatgtatcac 

ggtctcgaat 

gagagtcgag 

gggacgatac 

gcctttcgtt 

gtgcgcacaa 

ccccactgtc 

ggtgtggatg 

tcccgacgtg 

tgtcaaccac 

ggtgcgcatt 

caatcccatc 

gtatcgcctg 

ggtcgcggac 

tgaggaagtt 

tatgcaggtc 

tggcagcttg 

tcccgatatt 

caaggatacg 

ccgtcccaag 

caagaatggc 



gttcatcctc 
caaggaaaaa 
ttggccatcc 
gtggacactc 
gcgtttgggg 
gagctgccca 
ggctacttta 
gctcttatct 
gtcgaacgca 
gtcggactca 
tggaagattc 
taccaacata 
tcgacgtctg 
atcaaccagc 
caggacatca 
tcgacatggc 
attgttcccc 
atggtgtcgt 
cagtggccgt 
gagactacgc 
aactaccagg 
ctggccatca 
ttttggcaag 
gaagagtaga 
aaaaggagat 



catttcgcca 
ccttcacctg 
gcggcagggt 
tcctgctcgg 
ctgcagatgc 
tcttcccgga 
cggatcggaa 
ttggatcctt 
catggcttca 
accctcttca 
tgggagccac 
tgctcggcca 
agcccgatgt 
acatgtttgt 
acattttgta 
acactgtgat 
tgcagtatct 
cttactggct 
tgcctgacga 
aggattacgc 
ctgtgcacca 
tcaagaacac 
catttgcttc 
agaaaaaaag 
caagtggaca 



cctgcattct 
ggaagagctg 
gtacgatgtc 
agctggccga 
cattatgaag 
gccaacggtg 
cattgatccc 
gatcgcttcc 
ggtg'gtgttt 
tgatgcgtct 
gcacgacttt 
tcacccctac 
tcgtcgtatc 
tcctttcctg 
ctttgtcaag 
gttctggggc 
gcccctgggc 
ggcgctgacc 
gaacgggatc 
abacgattcg 
tctgttcccc 
ctgcagcgag 
acatttggag 
cgccgaatga 
ttctctatga 



ttacgaccgt 
gcggcccata 
acaaagttct 
gatgttactc 
aagtactatg 
ttccacaaaa 
aagaatagac 
tactacgcgc 
gcaatcatca 
cacttttcag 
ttcaacggag 
accaacattg 
aagcccaacc 
tacggactgc 
accaatgacg 
ggcaaggctt 
aaggtgctgc 
ttccaggcga 
atccaaaagg 
cacctctgga 
aacgtgtcgc 
tacaaggttc 
cacttgcgtg 
agtattgccc 
aga 



taagcaagat 
acaccaagga 
tgagccgcca 
cggtctttga 
tcggtacact 
ccatcaagac 
cagagatctg 
agctctttgt 
tgggatttgc 
tgacccacaa 
catcgtacct 
ctggagcaga 
aaaagtggtt 
tggcgttcaa 
ctattcgtgt 
tctttgtctg 
tcttgttcac 
accacgttgt 
actgggcagc 
ccagcatcac 
agcaccatta 
cataccttgt 
ttcttggact 
cctttttctc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
. 960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1483 



<210> 46 

<211> 446 

<212> PRT 

<213> Mortierella alpina 

<400> 46 I 

Met Gly Thr Asp Gin Gly Lys Thr Phe Thr Trp Glu! Glu Leu Ala Ala 
• 1 5 10 15 

His Asn Thr Lys Asp Asp Leu Leu Leu Ala He Arg Gly Arg Val Tyr 
20 25 30 

Asp Val Thr Lys Phe Leu Ser Arg His Pro Gly Gly Val Asp Thr Leu 
35 40 45 

Leu Leu Gly Ala Gly Arg Asp Val Thr Pro Val Phe Glu Met Tyr His 
50 55 60 
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Ala Phe Gly Ala Ala Asp Ala lie Met Lys Lys Tyr Tyr Val Gly Thr 
65 70 75 80 

Leu Val Ser Asn Glu Leu Pro lie Phe Pro Glu Pro Thr Val Phe His 
85 90 95 

Lys Thr lie Lys Thr Arg Val Glu Gly Tyr Phe Thr Asp Arg Asn lie 
100 105 110 

Asp Pro Lys Asn Arg Pro Glu lie Trp Gly Arg Tyr Ala Leu He Phe 
115 120 125 

Gly Ser Leu He Ala Ser Tyr Tyr Ala Gin Leu Phe Val Pro Phe Val 
130 135 140 

Val Glu Arg Thr Trp Leu Gin Val Val Phe Ala He He Met Gly Phe 
145 150 155 160 

Ala Cys Ala Gin Val Gly Leu Asn Pro Leu His Asp Ala Ser His Phe 
165 170 175 

Ser Val Thr His Asn Pro Thr Val Trp Lys He Leu Gly Ala Thr His 
180 185 190 

Asp Phe Phe Asn Gly Ala Ser Tyr Leu Val Trp Met Tyr Gin His Met 
195 200 205 

Leu Gly His His Pro Tyr Thr Asn He Ala Gly Ala Asp Pro Asp Val 
210 215 220 

Ser Thr Ser Glu Pro Asp Val Arg Arg He Lys Pro Asn Gin Lys Trp 
225 230 235 240 

Phe Val Asn His He Asn Gin His Met Phe Val Pro Phe Leu Tyr Gly 
245 250 255 

Leu Leu Ala Phe Lys Val Arg He Gin Asp He Asn He Leu Tyr Phe 
260 265 270 

Val Lys Thr Asn Asp Ala He Arg Val Asn Pro He Ser Thr Trp His 

275 280 285 

Thr Val Met Phe Trp Gly Gly Lys Ala Phe Phe Val Trp Tyr Arg Leu 
290 295 300 

He Val Pro Leu Gin Tyr Leu Pro Leu Gly Lys Val Leu Leu Leu Phe 
305 310 315 320 

Thr Val Ala Asp Met Val Ser Ser Tyr Trp Leu Ala Leu Thr Phe Gin 
325 330 335 

Ala Asn His Val Val Glu Glu Val Gin Trp Pro Leu Pro Asp Glu Asn 
340 345 350 

Gly He He Gin Lys Asp Trp Ala Ala Met Gin Val Glu Thr Thr Gin 

355 360 365 

Asp Tyr Ala His Asp Ser His Leu Trp Thr Ser He Thr Gly Ser Leu 
370 375 380 
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Asn Tyr Gin Ala Val His His Leu Phe Pro Asn Val Ser Gin His His 
385 390 395 400 

Tyr Pro Asp He Leu Ala He He Lys Asn Thr Cys Ser Glu Tyr Lys 
405 410 415 

Val Pro Tyr Leu Val Lys Asp Thr Phe Trp Gin Ala Phe Ala Ser His 
420 425 430 

Leu Glu His Leu Arg Val Leu Gly Leu Arg Pro Lys Glu Glu 
435 440 445 

<210> 47 

<211> 1350 

<212> DNA 

<213> Arabidopsis thaliana 



<400> 47 

ctctctctct 

gaccaacgca 

tttgatccga 

cactgttggg 

gtcgcggctt 

tgggccgccc 

gggagtttct 

atcctcgttc 

catgttgaaa 

cacagtactc 

ttgtgctaca 

gctccaagcg 

agtcttatcg 

ccgtacatta 

gatgagaagt 

acaattgata 

atccatcatc 

aaacatgtgt 

ttggtggaga 

attgtcttct 

aattaatctc 

tttcttgttg 

tattaatgta 



ctctcttctc 
ccaatgtgaa 
gtgcacaacc 
ttaagagtcc 
tggccatcgc 
aaggaacact 
cagacattcc 
cttaccatgg 
acgacgagtc 
ggatgctcag 
gaagtcctgg 
agagaaagct 
ctctatcttt 
tctttgtgat 
tgccttggta 
gagattacgg 
tcttcccaca 
tgggaagata 
gtttggtcgc 
acgagacaga 
catttgttta 
tttttaagtt 
tttacattac 



tctttctctc 
cggagatccc 
accgttcaag 
tttgagatca 
tgccgtgtat 
tttctgggcc 
tctactgaat 
ttggagaata 
atgggttccg 
atacactgtc 
aaaagaagga 
tattgcaact 
cgtcttcggt 
gtggttggat 
tagaggcaag 
aatck-ttaac 
aatccctcac 
ctacagagaa 
aagtattaag 
tccagatctc 
gctctattag 
aaaagtgtac 
aaggcgtaaa 



cccctctctc 
ggcgccggag 
atcggagata 
atgagttacg 
gttgatagct 
atctttgttc 
agtgtggttg 
agccaccgga 
ttaccagaaa 
cctctcccca 
tcacatttta 
tcaactactt 
ccactcgcgg 
gctgtcacgt 
gaatCrgagtt 
aacattcatc 
tatcacttgg 
ccaaagacgt 
aaagatcatt 
tacgtttacg 
gaataaacca 
tcgtgaaact 



cggcgatggt 
accggaagaa 
taagggcggc 
tcgtcagaga 
ggttcctttg 
tcggccacga 
gtcacattct 
cacaccacca 
gggtgtacaa 
tgctcgcata 
acccatacag 
gttggtccat 
ttcttaaagt 
atttgcatca 
atctacgtgg 
acgacattgg 
tcgacgccac 
caggagcaat 
acgtcagcga 
cttctgacaa 
gcccactttt 
cttttttttt 



tgttgctatg 
agaagaaagg 
gattcctaag 
cattatcgcc 
gcctctttat 
ctgtggacat 
tcattctttc 
gaaccatggc 
gaaattgccc 
tcctctctat 
tagtttattt 
aatgttcgtc 
ctacggtgta 
tcatggtcac 
aggattaaca 
aactcacgtg 
gaaagcagct 
accgatccac 
cactggtgat 
atctaaaatc 
aaaattttta 
tctttttttt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1350 



<210> 48 

<211> 386 

<212> PRT 

<213> Arabidopsis thaliana 



<400> 48 I 

Met Val Val Ala Met Asp Gin Arg Thr Asn Val Asn | Gly Asp Pro Gly 
•1 5 10 15 

Ala Gly Asp Arg Lys Lys Glu Glu Arg Phe Asp Pro Ser Ala Gin Pro 
20 25 30 

Pro Phe Lys He Gly Asp He Arg Ala Ala He Pro Lys His Cys Trp 
35 40 45 

Val Lys Ser Pro Leu Arg Ser Met Ser Tyr Val Val Arg Asp He He 
50 55 60 
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Ala Val Ala Ala Leu Ala lie Ala Ala Val Tyr Val Asp Ser Trp Phe 
65 70 75 80 

Leu Trp Pro Leu Tyr Trp Ala Ala Gin Gly Thr Leu Phe Trp Ala lie 
85 90 95 

Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser Asp lie -Pro 
100 105 110 

Leu Leu Asn Ser Val Val Gly His He Leu His Ser Phe He Leu Val 

115 120 125 

Pro Tyr His Gly Trp Arg He Ser His Arg Thr His His Gin Asn His 
130 135 140 

Gly His Val Glu Asn Asp Glu Ser Trp Val Pro Leu Pro Glu Arg Val 
145 150 155 160 

Tyr Lys Lys Leu Pro His Ser Thr Arg Met Leu Arg Tyr Thr Val Pro 
165 170 175 

Leu Pro Met Leu Ala Tyr Pro Leu Tyr Leu Cys Tyr Arg Ser Pro Gly 
180 185 190 

Lys Glu Gly Ser His Phe Asn Pro Tyr Ser Ser Leu Phe Ala Pro Ser 

195 200 205 

Glu Arg Lys Leu He Ala Thr Ser Thr Thr Cys Trp Ser He Met Phe 
210 215 220 

Val Ser Leu He Ala Leu Ser Phe Val Phe Gly Pro Leu Ala Val Leu 
225 230 235 . 240 

Lys Val Tyr Gly Val Pro Tyr He He Phe Val Met Trp Leu Asp Ala 
245 250 255 

Val Thr Tyr Leu His His His Gly His Asp Glu Lys Leu Pro Trp Tyr 
260 265 270 

Arg Gly Lys Glu Trp Ser Tyr Leu Arg Gly Gly Leu Thr Thr He Asp 
275 280 285 

Arg Asp Tyr Gly He Phe Asn Asn He His His Asp He Gly Thr His 
290 295 300 

Val He His His Leu Phe Pro Gin He Pro His Tyr His Leu Val Asp 
305 310 315 ' 320 

Ala Thr Lys Ala Ala Lys His Val Leu Gly Arg Tyr Tyr Arg Glu Pro 
325 330 335 

Lys Thr Ser Gly Ala He Pro He His Leu Val Glu Ser Leu Val Ala 
340 345 350 

Ser He Lys Lys Asp His Tyr Val Ser Asp Thr Gly Asp He Val Phe 
355 360 365 

Tyr Glu Thr Asp Pro Asp Leu Tyr Val Tyr Ala Ser Asp Lys Ser Lys 
370 375 380 
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lie Asn 
385 

<210> 49 

<211> 834 

<212> DNA 

<213> Pavlova sp. 



<400> 49 

atgatgttgg 

gacattgaca 

tctgtggtct 

atgacggggc 

tggtgcgtgg 

aaggtggacc 

aacaagtacg 

gtctccttcc 

aagctcggca 

atgatgtatt 

ctcacgcagg 

gtaacgggtg 

ctggtgctct 

gcgaaaaagc 



ccgcaggcta 
accccaacgg 
atctcagcgg 
tgaaggacta 
tggcctttct 
ttgggcccaa 
tggagctcct 
tccacgtcta 
atggtggtga 
cctactacac 
cacagctcgt 
cctacccgtg 
tcacacgctt 
tcgcacagga 



tcttctagtg 
ggcctactcg 
tgtgtttggg 
catgttcact 
cctggaggtg 
ctccttcagg 
cgacacccta 
tcatcacgtg 
cgcatatttt 
catggcgctc 
gcagttttgc 
gcgaatttgc 
ctaccgccag 
ggcatcacag 



ctctcggccg 
acctcgtgga 
ctcacaaagt 
tacaatctct 
cggcgtgcgg 
ctcggcttcg 
tggatggtgc 
cttctgatgt 
ggcggtctca 
ctgggctggt 
atctgcctcg 
ttggtggagg 
gcctatgcca 
gccaaggcgg 



ctcgccagag 
ctggcctgcc 
acttcgagaa 
accaggtgat 
gcatgtcact 
tcacgtgggt 
tgcgcaagaa 
gggcctggtt 
tgaactcgat 
catgcccctg 
cccactccac 
tgtgggtgat 
aggaggcgaa 
tcaaggcgga 



cttccagcag 
cattgtgatg 
ccggaagccc 
catcaacgtg 
catcggcaat 
gcactacaac 
gacgcagcag 
cgttgtcgtc 
catccacgtg 
gaagcgctac 
atgggcggca 
ggtgtccatg 
ggccaaggag 
gtaa 



<210> 50 

<211> 277 

<212> PRT 

<213> Pavlova sp. 

<400> 50 

Met Met Leu Ala Ala Gly Tyr Leu Leu Val Leu Ser Ala Ala Arg Gin 
1 5 10 . 15 

Ser Phe Gin Gin Asp lie Asp Asn Pro Asn Gly Ala Tyr Ser Thr Ser 
20 25 30 

Trp Thr Gly Leu Pro lie Val Met Ser Val Val Tyr Leu Ser Gly Val 
35 40 45 

Phe Gly Leu Thr Lys Tyr Phe Glu Asn Arg Lys Pro Met Thr Gly Leu 
50 55 60 

Lys Asp Tyr Met The Thr Tyr Asn Leu Tyr Gin Val lie lie Asn Val 
65 70 75 80 

Trp Cys Val Val Ala Phe Leu Leu Glu Val Arg Arg Ala Gly Met Ser 
85 90 95 

Leu He Gly Asn Lys Val Asp Leu Gly Pro Asn SeJ Phe Arg Leu Gly 
100 105 110 

Phe Val Thr Trp Val His Tyr Asn Asn Lys Tyr Val Glu Leu Leu Asp 
115 120 125 

Thr Leu Trp Met Val Leu Arg Lys Lys Thr Gin Gin Val Ser Phe Leu 
130 135 140 

His Val Tyr His His Val Leu Leu Met Trp Ala Trp Phe Val Val Val 
145 150 155 160 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
834 



23 



WO 2004/071467 PCTAJS: 

Lys Leu Gly Asn Gly Gly Asp Ala Tyr Phe Gly Gly Leu Met Asn Ser 
165 170 175 

lie lie His Val Met Met Tyr Ser Tyr Tyr Thr Met Ala Leu Leu Gly 
180 185 190 

Trp Ser Cys Pro Trp Lys Arg Tyr Leu Thr Gin Ala Gin Leu Val Gin 



Tyr Pro Trp Arg lie Cys Leu Val Glu Val Trp Val Met Val Ser Met 
225 230 235 240 

Leu Val Leu Phe Thr Arg Phe Tyr Arg Gin Ala Tyr Ala Lys Glu Ala 

245 250 255 

Lys Ala Lys Glu Ala Lys Lys Leu Ala Gin Glu Ala Ser Gin Ala Lys 
260 265 270 

Ala Val Lys Ala Glu 
275 

<210> 51 
<211> 1530 
<212> DNA 

<213> Schizochytrium aggregatum 
<400> 51 

atgacggtgg gcggcgatga ggtgtacagc atggcgcagg tgcgcgacca caacaccccg 60 

gacgacgcct ggtgcgccat ccacggcgag gtgtacgagc tgaccaagtt cgcccgcacc 120 

caccccgggg gggacatcat cttgctggcc gccggcaagg aggccaccat cctgttcgag 180 

acgtaccacg tgcgccccat ctccgacgcg gtcctgcgca agtaccgcat cggcaagctc 240 

gccgccgccg gcaaggatga gccggccaac gacagcacct actacagctg ggacagcgac 300 

ttttacaagg tgctccgcca gcgtgtcgtg gcgcgcctcg aggagcgcaa gatcgcccgc 360 

cgcggcggcc ccgagatctg gatcaaggcc gccatcctcg tcagcggctt ctggtccatg 420 

ctctacctca tgtgcaccct ggacccgaac cgcggcgcca tcctggccgc catcgcgctg 480 

ggcatcgtcg ccgccttcgt cggcacgtgc attcagcacg acggcaacca cggcgcgttc 540 

gccttctctc cgttcatgaa caagctctct ggctggacgc tcgacatgat cggcgccagt 600 

gccatgacct gggagatgca gcacgtgctg ggccaccacc cgtacaccaa cctgatcgag 660 

atggagaacg gcacccaaaa ggtcacccac gccgacgtcg accccaagaa ggccgaccag 720 

gagagcgacc cggacgtctt cagcacctac cccatgctcc gtctgcaccc gtggcaccgc 780 

aagcgcttct accaccgctt ccagcacctg tacgcgccgc tgctcttcgg tttcatgacc 840 

atcaacaagg tgatcaccca ggatgtggga gttgtcctca gcaagcgtct gtttcagatc 900 

gatgccaact gccgttacgc cagcaagtcg tacgttgcgc gcttctggat catgaagctg 960 

ctcaccgtcc tctacatggt cgccctcccc gtgtacaccc agggccttgt cgacgggctc 1020 

aagctcttct tcatcgccca cttttcgtgc ggcgagctgc tggccaccat gttcatcgtc 1080 

aaccacatca tcgagggcgt ctcgtacgcc tccaaggact ctgtcaaggg caccatggcg 1140 

ccgccgcgca cggtgcacgg cgtgaccccg atgcatgaca cccgcgacgc gctcggcaag 1200 

gagaaggcag ccaccaagca cgtgccgctc aacgactggg ccgcggtcca gtgccagacc 1260 

tcggtcaact ggtcgatcgg ctcgtggttc tggaaccact tctccggcgg gctcaaccac 1320 

cagatcgagc accacctctt ccccggcctc acccacacca cctacgtgta cattcaggat 1380 

gtggtgcagg cgacgtgcgc cgagtacggg gtcccgtacc agtcggagca gagcctcttc 14 40 

tccgcctact tcaagatgct ctcccacctt cgggcgctcg gcaacgagcc gatgccctcg 1500 

tgggagaagg accaccccaa gtccaagtga 1530 



<210> 52 
<211> 509 
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<212> PRT 

<213> SchizochytriuAi aggregatum 

<400> 52 , ^ 

Met Thr Val Gly Gly Asp .Glu Val Tyr Ser Met Ala Gin Val Arg Asp 
1 5 10 IS 

His Asn Thr Pro Asp Asp Ala Trp Cys Ala lie His Gly Glu Val Tyr 
20 25 30 

Glu Leu Thr Lys Phe Ala Arg Thr His Pro Gly Gly Asp lie He Leu 
35 40 45 

Leu Ala Ala Gly Lys Glu Ala Thr He Leu Phe Glu Thr Tyr His Val 
50 55 60 

Arg Pro He Ser Asp Ala Val Leu Arg Lys Tyr Arg He Gly Lys Leu 
65 70 75 80 

Ala Ala Ala Gly Lys Asp Glu Pro Ala Asn Asp Ser Thr Tyr Tyr Ser 
85 90 95 

Trp Asp Ser Asp Phe Tyr Lys Val Leu Arg Gin Arg Val Val Ala Arg 
100 105 110 

Leu Glu Glu Arg Lys . He Ala Arg T^g Gly Gly Pro Glu He Trp He 
115 120 125 

Lys Ala Ala He Leu Val Ser Gly Phe Trp Ser Met Leu Tyr Leu Met 
130. 135 140 

Cys Thr Leu Asp Pro Asn Arg Gly Ala He Leu Ala Ala He Ala Leu 
145 • 150 155 • 160 

Gly He Val Ala Ala Phe Val Gly Thr Cys He Gin His Asp Gly Asn 
165 170 175 

His Gly Ala Phe Ala Phe Ser Pro Phe Met Asn Lys Leu Ser Gly Trp 
180 185 190 

Thr Leu Asp Met He Gly Ala Ser Ala Met Thr Trp Glu Met Gin His 

195 200 205 

Val Leu Gly His His Pro Tyr Thr Asn Leu He Glu Met Glu Asn Gly 
210 215 220 

Thr Gin Lys Val Thr His Ala Asp Val Asp Pro Lys Lys Ala Asp Gin 
225 230 235 | 240 

Glu Ser Asp Pro Asp Val Phe Ser Thr Tyr Pro Met Leu Arg Leu His 
245 250 255 

Pro Trp His Arg Lys Arg Phe Tyr His Arg Phe Gin His Leu Tyr Ala 
260 265 270 

Pro Leu Leu Phe Gly Phe Met Thr He Asn Lys Val He Thr Gin Asp 

275 280 285 

Val Gly Val Val Leu Ser Lys Arg Leu Phe Gin He Asp Ala Asn Cys 
290 295 300 
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Arg Tyr Ala Ser Lys Ser Tyr Val Ala Arg Phe Trp He Met Lys Leu 
305 310 315 320 

Leu Thr Val Leu Tyr Met Val Ala Leu Pro Val Tyr Thr Gin Gly Leu 
325 330 335 

Val Asp Gly Leu Lys Leu Phe Phe He Ala His Phe Ser Cys Gly Glu 

340 345 .350 

Leu Leu Ala Thr Met Phe He Val Asn His He He Glu Gly Val Ser 
355 360 365 

Tyr Ala Ser Lys Asp Ser Val Lys Gly Thr Met Ala Pro Pro Arg Thr 
370 375 380 

Val His Gly Val Thr Pro Met His Asp Thr Arg Asp Ala Leu Gly Lys 

385 390 395 400 

Glu Lys Ala Ala Thr Lys His Val Pro Leu Asn Asp Trp Ala Ala Val 
405 410 415 

Gin Cys Gin Thr Ser Val Asn Trp Ser He Gly Ser Trp Phe Trp Asn 

420 425 430 

His Phe Ser Gly Gly Leu Asn His Gin He Glu His His Leu Phe Pro 
435 440 445 

Gly Leu Thr His Thr Thr Tyr Val Tyr He Gin Asp Val Val Gin Ala 
450 455 460 

Thr Cys Ala Glu Tyr Gly Val Pro Tyr Gin Ser Glu Gin Ser Leu Phe 
465 470 475 480 

Ser Ala Tyr Phe Lys Met Leu Ser His Leu Arg Ala Leu Gly Asn Glu 
485 490 495 

Pro Met Pro Ser Trp Glu Lys Asp His Pro Lys Ser Lys 
500 505 



<210> 


53 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


53 


■gcggccgcat gactgaggat aagacga 


<210> 


54 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 



27 



<220> 

<223> synthetic oligonucleotide 
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<400> 54 

gcggccgctt agtccgactt ggccttg 

<210> 55 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 55 

gcggccgcat ggagtcgatt gcgc 

<210> 56 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 56 

gcggccgctt actgcaactt cctt 

<210> 57 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 57 

gcggccgcat gggaacggac caag 

<210> 58 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 58 

gcggccgcct actcttcctt ggga 

<210> 59 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 59 

ttcctgcagg ctagcctaag tacgtactc 

<210> 60 

<211> 21 



PCTAJS2004/005758 
27 

24 
24 
24 
24 
29 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 
<400> 60 

aagcggccgc ggtgatgact g 21' 

<210> 61 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 
<400> 61 

Thr Arg Ala Ala lie Pro Lys His Cys Trp Val Lys 
15 10 

<210> 62 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<210> 63 

<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 

<400> 63 

Ala Leu Phe Val Leu Gly His Asp Cys Gly His Gly Ser Phe Ser 
15 10 15 

<210> 64 

<211> 45 

<212> DNA I 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 
<220> 

<221> unsure 

<:222> (21).. (21) 

<223> y « c or t 

<220> 

<221> unsure 



<220> 
<223> 



synthetic oligonucleotide 



<400> 62 

atccgcgccg ccatccccaa gcactgctgg gtcaag 



36 
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<222> 
<223> 



(33) . . (33) 
y «= c or t 



<400> 64 

gccctcttcg tcctcggcca ygactgcggc cayggctcgt tctcg 

<210> 65 

<211> 45 

<212> DNA 

<213> Artificial Sequence 



45 



<220> 
<223> 



synthetic oligonucleotide 



<220> 

<221> unsure 

<222> (4).. (4) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (10).. (10) 

<223> r « a or g 

<220> 

<221> unsure 

<222> (30) ,.(30) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (31).. (31) 

<223> r « a or g ' 

<220> 

<221> unsure 

<222> (34).. (34) 

<223> r a or g 

<220> 

<221> unsure 

<222> (38).. (38) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (39) -.(39) 

<223> y « c or t 

<220> 

<221> unsure 
<222> (43) . . (43) 

<223> r = a or g 



<400> 65 

gagrtggtar tgggggatct gggggaagar rtgrtggryg acrtg 



45 



<210> 
<211> 



66 
15 



29 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 



<400> 66 

Pro Tyr His Gly Trp Arg lie Ser His Arg Thr His His Gin Asn 
1 5 10 . 15 

<210> 67 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<220> 

<221> unsure 

<222> (9).. (9) 

<223> y = c or t 

<220> 

<221> unsure 

<222> (27).. (27) 

<223> y - c or t 

<220> 

<221> unsure 

<222> (36) . . (36) 

<223> y - c or t 

<220> 

<221> unsure 

<222> (39) . . (39) 

<223> y = c or t 

■ 

<400> 67 

ccctaccayg gctggcgcat ctcgcaycgc acccaycayc agaac 45 



<210> 68 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



<220> 

<221> unsure 

<222> (7) . .(7) 

<'223> r = a or g 



<220> 

<221> unsure 

<222> (10) . . (10) 

<223> r = a or g 
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<220> 
<221> 
<222> 
<223> 



unsure 
(19) . . (19). 
r « a or g 
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<220> 

<221> unsure 

<222> (37).. (37) 

<223> r = a or g 

<400> 68 

gttctgrtgr tgggtccgrt gcgagatgcg ccagccrtgg taggg 4 5 

<210> 69 

<211> 12 

<2I2> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 



<220> 

<221> UNSURE 

<222> (5) . . (5) 

<223> Xaa = Asp or His 

<220> 

<221> UNSURE 

<222> (7).. (7) 

<223> Xaa = Asp or Tyr 



<400> 69 

Gly Ser His Phe Xaa Pro Xaa Ser Asp Leu Phe Val 
15 10 



<210> 70 

<211> 36 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic oligonucleotide 
<220> 

<221> unsure 

<222> (13).. (13) 

<223> s = c or g 

<220> 

<221> unsure 

<222> (19). .(19) 

<223> k = g or t 



<400> 70 

ggctcgcact tcsaccccka ctcggacctc ttcgtc 36 

<210> 71 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> synthetic oligonucleotide 
<220> 

<221> unsure 

<222> (18) . . (18) 

<223> m « a or c 



<220> 

<221> unsure 

<222> (24).. (24) 

<223> w = a or t 



<400> 71 

gacgaagagg tccgagtmgg ggtwgaagtg cgagcc 36 

<210> 72 

<211> 13 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 
<;220> 

<221> UNSURE 

<222> (3) . . (3) 

<223> Xaa = Tyr or Phe 



<220> 

<221> UNSURE 

<222> (4) . . (4) 

<223> Xaa = Leu or Val 



<220> 

<221> UNSURE 

<222> (11) . . (11) 

<223> Xaa » Leu or lie 



<400> 72 

Tip Ser Xaa Xaa Arg Gly Gly Leu Thr Thr Xaa Asp Arg 
15 10 



<210> 73 

<211> 39 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 



<220> 

<221> unsure 

<222> (9) . . (9) 

<223> k g or t 



<220> 

<221> unsure 
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<222> (30).- (30) 

<223> w « a or t ' 

<220> 

<221> unsure 

<222> (32).. (32) 

<223> s = c or g 

<400> 73 

gcgctggakg gtggtgaggc cgccgcggaw gsacgacca 39 

<210> 74 

<211> 15 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> consensus peptide 

<400> 74 

His His Asp lie Gly Thr His Val lie His His Leu Phe Pro Gin 
1 5 10 15 

<210> 75 

<211> 45 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> synthetic oligonucleotide 
<220> 

<221> unsure 

<222> (13).. (13) 

<223> r - a or g 



<220> 

<221> unsure 

<222> (16) . . (16) 

<223> r = a or g 

<220> 

<221> unsure 

<222> (25j..(25) 

<223> r = a or g 



<220> 

<221> unsure I 

<222> (40).. (40) I 

<223> r « a or g 

<220> 

<221> unsure 

<222> (43) . . (43) 

<223> r = a or g 

<400> 75 

ctgggggaag agrtgrtgga tgacrtgggt gccgatgtcr tgrtg 
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<210> 76 

<211> 15 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> consensus peptide 
<220> 

<221> UNSURE 

<222> (2) . . (2) 

<223> Xaa «= Leu or Phe 

<220> 

<221> UNSURE 

<222> (5).. (5) 

<223> Xaa «= Gin or Lys 



<220> 

<221> UNSURE 

<222> (12) . . (12) 

<223> Xaa « Val or He 



<400> 76 

His Xaa Phe Pro Xaa He Pro His Tyr His Leu Xaa Glu Ala Thr 
15 10 15 



<210> 77 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<220> 

<221> unsure 

<222> (12) . . (12) 

<223> y - c or t 



<220> 

<221> unsure 

<222> (16) - . (16) 

<223> r - a or g 



<220> 

<221> unsure 

<222> (22) , . (22) 

<223> r - a or g 

<220> 

<221> unsure 

<222> (33).. (33) 

<223> k = g or t 



<220> 

<221> unsure 

<222> (42).. (42) 

<223> r = a or g 
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<220> 

<221> unsure ' 

<222> (43)..<43) 

<223> r - a or g 

<400> 77 

ggtggcctcg aygagrtggt artgggggat ctkggggaag arrtg 



<210> 
<211> 
<212> 
<213> 


78 
15 
PRT 

Artificial 


Sequence 


<220> 
<223> 


consensus 


peptide 


<220> 
<221> 
<222> 
<223> 


UNSURE 
(3) . . (3) 
Xaa » Ala 


or lie 


A A A A 
O H CM ro 
CM CM CM CM 
CM CM CM CM 
V V V V 


UNSURE 
(6) . . (6) 
Xaa = Leu 


or Phe 


<400> 


78 





His Val Xaa His His Xaa Phe Pro Gin He Pro His Tyr His Leu 
15 10 15 



<210> 


79 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Secjuence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


79 


tacgcgtacc tcacgtactc gctcg 


<210> 


80 


<211> 


27 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


80 


ttcttgcacc acaacgacga agcgacg 


<210> 


81 


<211> 


25 


<212> 


DNA 


<213> 


Artificial Sequence 



25 



<220> 

<223> synthetic oligonucleotide 



35 
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<400> 81 

ggagtggacg tacgtcaagg gcaac 



<210> 


82 


<211> 


26 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


82 


tcaagggcaa cctctcgagc gtcgac 


<210> 


83 


<211> 


31 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


83 


cccagtcacg acgttgtaaa acgacggcca 


<210> 


84 


<211> 


30 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


synthetic oligonucleotide 


<400> 


84 


agcggataac aat-ttcacac aggaaacagc 


<210> 


85 


<211> 


30 


<212> 


DNA 


<213> 


Artificial Sequence 



<220> 

<223> synthetic oligonucleotide 

<400> 85 

ggtaaaagat ctcgtccttg tcgatgttgc 

<210> 86 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 

<400> 86 

gtcaaagtgg ctcatcgtgc 

<210> 87 

<211> 26 

<212> DNA 

<213> Artificial Sequence 



PCTAJS2004/005758 
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30 
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<220> 

<223> synthetic oligonucleotide 

<400> 87 

cgagcgagta cgtgaggtac gcgtac 

<210> 88 

<211> 45 

<212> DNA 

<213> Artificial Sequence 



26 



<220> 
<223> 



synthetic oligonucleotide 



<400> 88 

tcaacagaat tcatgaccga ggataagacg aaggtcgagt tcccg 

<210> 89 

<211> 45 

<212> DNA 

<213> Artificial Sequence 



45 



<220> 
<223> 



synthetic oligonucleotide 



<400> 89 

aaaagaaagc ttcgcttcct agtcttagtc cgacttggcc ttggc 

<210> 90 

<211> 3979 

<212> DNA 

<213> Glycine max 



45 



<400> 90 

ggccgcagat 

aactcacatc 

taaatattaa 

tgtataaaat 

tttttaattt 

aaaagcattc 

tgagtggtgt 

aqacaaata a _ 

tcaattggtt 

cccaatctaa 

aggggctttt 

atgctcccct 

tctctctgct 

tcctagattc 

tcctcaactg 

aatacgcgtt 

agcttatgag 

gactgagatg 

tctgtagttt 

agtataggtc 

ccagtttgtt 

ttttttatgg 

gtactaactc 

gcggccaccg 

aaacgcatta 

gctctcaaaa 



ttaggtgaca 
caaacataac 
tcattatttt 
tatattattc 
cttaaaaaat 
atcatttaat 
gggagtaggc 
aaagtatgca 
gctacattca 
gccgcggatg 
ccgtcattaa 
ctaaactcgt 
cttctcttct 
cgtggttcaa 
ggtccatcta 
ggactttcag 
aaataaaatc 
gggtttggtg 
tgttttgttt 
acaataggaa 
tgtatatatg 
ctgaaatttt 
taggcttgtt 
cttccagaac 
ctagatccac 
tcaaatgttc 



ctatagaata 
atggatatct 
taagatatta 
atgatttttc 
gttgcaagac 
acattaaaaa 
aacctggcat 
acaaacaaat 
attttcaact 
caaacggttg 
ctcacccctg 
atcgcttcag 
cttctacctc 
tttcggatct 
ggatccatgt 
atctagtcga 
actttttttt 
attacagaat 
tcttgtttct 
ttcaaacttt 
tttaaaaaat 
tgcatgtgtc 
gtgcagtttt 
cacccgattc 
cctccctctc 
catctccaaa 



tgcatcacta 
tccttaccaa 
attaagaaat 
atacatttga 
acttattaga 
atatttaata 
tgaaacgaga 
caaaatcaaa 
cagtcaacgg 
aatctaaccc 
ccacccggtt 
agttgagacc 
tcaaggtact 
tgcacttctg 
gaaactctac 
aatcatttca 
tatttcaaaa 
tttagcgaat 
catacattcc 
gagcagggga 
gaaacttttg 
tttgctctct 
tgaagtataa 
tcttcttcct 
tctcatcaaa 
ccccccacgg 

37 



gtaagctttg 
tcatactaat 
taaaagattt 
ttttgataat 
catagtcttg 
ctaacagtag 
gaaagagagt 
gggcaaaggc 
ttgagattca 
acaatccaat 
tccctataaa 
aagacacact 
tttcttctcc 
gtttgctttg 
tctttcttta 
taattgcctt 
taaaccttgg 
tttgtaattg 
ttaggcttca 
attaatccct 
ctttaaattc 
gttgtaaatt 
ccatgccaca 
cttcacaccc 
ccctcaccaa 
cggcgccctt 



ctctagatca 
tattttgggt 
tttaaaaaaa 
aaatatattt 
ttctgtttac 
aatcttcttg 
cagaaccaga 
tggggttggc 
ctctgacttc 
ctcgttactt 
ttggaactca 
cgttcatata 
ctctaccaaa 
ccttgctttt 
atatctgcgg 
tctttctttt 
gccttgtgct 
tacttgtttg 
attttattcg 
tccttcaaat 
tattataact 
tactgtttag 
caacacaatg 
caccttcccc 
acccaaccac 
caccaaggaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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gcgccgacca cggagccctt cgtgtcacgg ttcgcctccg gcgaacctcg caagggcgcg 1620 

gacatccttg tggaggcgct ggagaggcag ggcgtgacga cggtgttcgc gtaccccggc 1680 

ggtgcgtcga tggagatcca ccaggcgctc acgcgctccg ccgccatccg caacgtgctc 1740 

ccgcgccacg agcagggcgg cgtcttcgcc gccgaaggct acgcgcgttc ctccggcctc 1800 

cccggcgtct gcattgccac ctccggcccc ggcgccacca acctcgtgag cggcctcgcc 1860 

gacgctttaa tggacagcgt cccagtcgtc gccatcaccg gccaggtcgc ccgccggatg 1920 

atcggcaccg acgccttcca agaaaccccg atcgtggagg tgagcagatc catcacgaag 1980 

cacaactacc tcatcctcga cgtcgacgac atcccccgcg tcgtcgccga ggctttcttc 2040* 

gtcgccacct ccggccgccc cggtccggtc ctcatcgaca ttcccaaaga cgttcagcag 2100 

caactcgccg tgcctaattg ggacgagccc gttaacctcc ccggttaqct cgccaggctg 2160 

cccaggcccc ccgccgaggc ccaattggaa cacattgtca gactcatcat ggaggcccaa 2220 

aagcccgttc tctacgtcgg cggtggcagt ttgaattcca gtgctgaatt gaggcgcttt 2280 

gttgaactca ctggtattcc cgttgctagc actttaatgg gtcttggaac ttttcctatt 234 0 

ggtgatgaat attcccttca gatgctgggt atgcatggta ctgtttatgc taactatgct 2400 

gttgacaata gtgatttgtt gcttgccttt ggggtaaggt ttgatgaccg tgttactggg 2460 

aagcttgagg cttttgctag tagggctaag attgttcaca ttgatattga ttctgccgag 2520 

attgggaaga acaagcaggc gcacgtgtcg gtttgcgcgg atttgaagtt ggccttgaag 2580 

ggaattaata tgattttgga ggagaaagga gtggagggta agtttgatct tggaggttgg 2640 

agagaagaga ttaatgtgca gaaacacaag tttccattgg gttacaagac attccaggac 2700 

gcgatttctc cgcagcatgc tatcgaggtt cttgatgagt tgactaatgg agatgctatt 2760 

gttagtactg gggttgggca gcatcaaatg tgggctgcgc agttttacaa gtacaagaga 2820 

ccgaggcagt ggttgacctc agggggtctt ggagccatgg gttttggatt gcctgcggct 2880 

attggtgctg ctgttgctaa ccctggggct gttgtggttg acattgatgg ggatggtagt 2940 

ttcatcatga atgttcagga gttggccact ataa|gagtgg agaatctccc agttaagata 3000 

ttgttgttga acaatcagca tttgggtatg gtggttcagt tggaggatag gttctacaag 3060 

tccaatagag ctcacaccta tcttggagat ccgtctagcg agagcgagat attcccaaac 3120 

atgctcaagt ttgctgatgc ttgtgggata ccggcagcgc gagtgacgaa gaaggaagag 3180 

cttagagcgg caattcagag aatgttggac acccctggcc cctaccttct tgatgtcatt 324 0 

gtgccccatc aggagcatgt gttgccgatg attcccagta atggatcctt caaggatgtg 3300 

ataactgagg gtgatggtag aacgaggtac tgattgccta gaccaaatgt tccttgatgc 3360 

ttgttttgta caatatatat aagataatgc tgtcctagtt gcaggatttg gcctgtggtg 3420 

agcatcatag tctgtagtag ttttggtagc aagacatttt attttccttt tatttaactt 3480 

actacatgca gtagcatcta tctatctctg tagtctgata tctcctgttg tctgtattgt 354 0 

gccgttggat tttttgctgt agtgagactg aaaatgatgt gctagtaata atatttctgt 3600 

tagaaatcta agtagagaat ctgttgaaga agtcaaaagc taatggaatc aggttacata 3660 

tcaatgtttt tcttttttta gcggttggta gacgtgtaga ttcaacttct cttggagctc 3720 

acctaggcaa tcagtaaaat gcatattcct tttttaactt gccatttatt tacttttagt 3780 

ggaaattgtg accaatttgt tcatgtagaa cggatttgga ccattgcgtc cacaaaacgt 3840 

ctcttttgct cgatcttcac aaagcgatac cgaaatccag agatagtttt caaaagtcag 3900 

aaatggcaaa gttataaata gtaaaacaga atagatgctg taatcgactt caataacaag 3960 

tggcatcacg tttctagtt 397 9 

<210> 91 
<211> 17 
<212> DNA 

<213> Artificial Sequence 

<220> ' 
<223> synthetic oligonucleotide 

<400> 91 

tgcggccgca tgagccg X7 

<210> 92 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 
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<400> 92 

acgtacggta ccatctgcta atattttaaa tc 

<210> 93 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



32 



<400> 93 

taatacgact cactattagg 
<210> 94 
<211> 35 
<212> DNA 

<213> Artificial Sequence 



20 



<220> 

<223> synthetic oligonucleotide 

<400> 94 

tgcccatgat gttggccgca ggctatcttc tagtg 

<210> 95 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



35 



<220> 
<223> 



synthetic oligonucleotide 



<400> 95 

gctgtcaacg atacgctacg taacg 

<210> 96 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



25 



<220> 
<223> 



synthetic oligonucleotide 



<400> 96 

gccaattgga gcgagttcca atctc 

<210> 97 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic oligonucleotide 



25 



<400> 97 

gcgatatccg tttcttctga ccttcatc 



28 



<210> 
<211> 



98 
28 
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<212> DNA 

<213> Artificial Sequence 

<220> 

<223> synthetic oligonucleotide 
<400> 98 

ttctagacct gcaggatata atgagccg 
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28' 
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